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Sales and Advertising of Lettuce 


1 


FRANK MEISSNER 
San Jose State College 


For several years California growers promoted nationally a new 
brand of lettuce, using as many as eight media. Ratings of two of 
these media, along with local retail price and temperature, were 
observed to be closely correlated with per capita sales in 22 cities. 


Two businessmen were talking. 

“Is your advertising getting results?” in- 
quired the first. 

“It sure is!” moaned the second. “Last 
week we advertised for a night watchman 
and the next night we were robbed.” 


_ LONG-RANGE PURPOSE of advertising and 
sales promotion is to persuade consumers to 
buy more of a commodity at the same price, or to 
pay higher prices for the same quantity, or to do a 
little of each. In the economist’s language, adver- 
ising effect can be gauged in terms of shifts 
cither in the position or the slope of the demand 
curve for the product (see Figure 1). ‘This theoreti- 
cal framework has been reviewed elsewhere (Fox, 
1959; Hoos, 1959; and Waugh, 1959). 

Conceptually, the measurement of advertising 
impact appears to be a straightforward task. How- 
ever, in reality it is quite a difficult job, because ad- 
vertising is only one of many “demand shifters.” 


‘Dratis of this paper were read by Thomas A. Mainwaring, 
senior economist, Stanford Research Institute; Walter J. Eng- 
lund, marketing economist, Bureau of Markets, California 
Department of Agriculture; and Karl V. Christierson, vice 
preside: Major Farms, Salinas, California. Their com- 
ments \cre greatly appreciated. Sole responsibility for the 
contents of the paper remains the present author’s. 
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Changes in income, taste, habits, prices and quanti- 
ties of competing products are some other impor- 
tant factors that affect demand. 

This unsatisfactory state of the art is clearly re- 
flected in the views expressed by the majority of 
respondents in a recent survey of executives in 152 
manufacturing firms made by the National Indus- 
trial Conference Board (Mathes and Thompson, 
1960). Their gloomy attitudes were confirmed by 
participants in a recent workshop on “measuring 
advertising effectiveness” held by the Northern 
California Chapter of the American Marketing As- 
sociation. “The consensus of the group was: “You 
can't do it.’ There are too many factors involved to 
[permit] realistic research. .” (Newsletter, North- 
ern California Chapter of AMA, 1960, p. 1). 

But it would appear that the great number of 
variables is by no means a fatal complication. After 
all, even a reasonably straightforward trend analv- 
sis incorporates in the model, almost as a matter of 
routine, several variables such as mentioned above. 
The parameters of demand models are being esti- 
mated by means of such simple and well Known 
methods as least squares regression and correlation, 
or perhaps by the more sophisticated tools of econo- 
metric analysis using systems of simultaneous equa- 


tions. 


Applications 


In recent vears several attempts have been made 
to applv these analytical tools to the measurement 


of advertising impact on sales of food and farm 


products. The slow going seems due far more to 
inadequate data on advertising and other inputs 
rather than to the inherent methodological dithculty 
of the problem. 

For example, one of the most mathematically 
elegant analyses (Baseman, 1955) found that tobacco 
consumption was more responsive to prices than 
to advertising. A least squares demand function 
showed that during the 20 vears 1929-1950. a one 
per cent increase in industry-wide advertising ex- 
penditure led to an average one-twentieth of a per 
cent increase in tobacco consumption, while a one 
per cent decrease in prices boosted consumption by 
one-fifth. Also estimated were demand and mar- 
ginal utility elasticities by the method of simul- 
taneous equations. No attempt was made to show 
how manipulations of an individual finm’s adver- 
using budget would have influenced the demand 
for the firm’s own brand or brands. 


‘the multiple regression method of analysis; and 


PURPOSE 
This paper has two objectives: first to review an 
advertising evaluation study of lettuce which used 


second to point out the direction which future re. 
search might take. 

Lettuce is used for illustrative purposes primarilh 
because past work was readily available (Mainwar- 
ing et al., 1955). Furthermore, lettuce is a relativel 
“easy” commodity for which impact of producer- 
sponsored advertising can be isolated. 


Production 

The great bulk of commercial production is con- 
centrated in rather well defined areas. Thus in 
1954-58 the national production of lettuce averaged 
about 1,630,000 tons annually, of which about 
990,000 tons, or almost two-thirds, were produced 
in California. 

California lettuce is marketed during two sea. 
sons from two major growing districts. The Im. 
perial Valley marketing season is from Novembe 
through March, the Salinas season from April 
through November. The Salinas district in the Cen- 
tral Coast area ships the largest volume—about two. 
thirds of all California lettuce or nearly one-hali 
of the total rail shipments of lettuce in the United 
States. From April through November shipment 
from the Salinas district dominate the U. S. lettuce 
trade. However, in recent vears shipments from the 
Salt River Vallev area in Arizona have been “cut 
ting into” Salinas lettuce shipments particular) 
during November and April (see Figure 2). 


Consumption 
In the last four decades lettuce has developed 
into one of the most important vegetables. In 195+ 
1958, per capita consumption was approximately I! 


pounds, or almost four times higher than in the 
immediate post-War I vears. Lettuce’s share’ of the 
total value of vegetables produced for the fresh} 
market increased from about ten per cent in 1920F 
to roughly 17 per cent in the most recent five veats 
In 1958, the farm value of lettuce marketed in thf 
United States amounted to about $150 million. 

This remarkable growth in lettuce consumpuon 
is due to a combination of several favorable fac} 
tors: changing dietary habits resulting in emphasi§ 
on non-fats: increased levels of living which allow j 
consumers to buy foods once considered luxuries: § 
fewer hours of hard manual labor which reduced} 
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Figure 2: AVERAGE WEEKLY SHIPMENTS OF LETTUCE (HOPKIN, 1960) 
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need for high energy foods; increased population; 
new lettuce varieties; and technological innovations 
which have made an assembly line operation out of 
lettuce growing: field packing, vacuum cooling and 
faster shipping. 

The per capita consumption of lettuce will grow 
further within the next decades for two reasons. 
First, many low-income consumers still eat little 
lettuce; as per capita income continues to grow and 
become more evenly distributed, more people will 
eat more lettuce. Second, institutional outlets (ho- 
tels, restaurants, schools, hospitals) are becoming 
increasingly important. On the average, relatively 
more lettuce is eaten in these places than in meals 
eaten at home. 

Combinations of these favorable factors will 
probably boost the per capita consumption of let- 
tuce to 22-25 pounds by 1975. A successful adver- 
tising and sales promotion campaign would tend to 
accelerate this upward trend, and might therefore 
show a “plus” effect. ‘This type of evidence is more 
convincing than if advertising tends to slow down 
a declining trend. 


The C-7 Campaign! 

In/1952 a group of lettuce shippers of Salinas, 
California formed Lettuce, Inc., a non-profit  cor- 
poration, in order to increase the consumption of 
lettuce by nationally advertising and promoting 
their C-7 brand (C for California, and 7 for seven 
days a week). In attempting to achieve this goal, 
part of the program was directed toward influenc- 
ing members of the produce trade to merchandise 
lettuce more effectively by: considering lettuce as 
the most basic commodity in their fresh produce 
line; using lettuce for promotional purposes; pre- 
packaging lettuce; and participating in the C-7 
program by handling C7 lettuce, using C-7 adver- 
tising and point-of-purchase materials, and using 
C-7 trademarked cellophane wrappers for prepack- 
aging C-7 lettuce. 

From 1952 to 1957 the C-7 campaign consisted 
partly of spot or network radio advertising. ‘lele- 
vision time on the “Home” program was bought in 
1955. In late 1955 and in 1956 the “Mickey Mouse 
Club” was used. Promotion efforts were carried out 
by fieldmen from Lettuce, Inc., who solicited sup- 
port for C-7 at wholesale as well as retail levels in 
major population centers of the United States; 
their work was supplemented by direct mailings. 

‘This section of the paper was discussed at the 120th annual 
meeting of the American Statistical Association at Stanford 


University August 26, 1960 b, H. S. Houthakker and F. 
Meissner. 
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Point-of-sale promotion materials were made avi il- 
able, together with prepackaging wrappers with the 
C-7 label imprint. Newspaper advertising mats wcre 
also supplied to retailers. 

The sustained campaign was handled by John 
Cohan Advertising in Salinas. ‘The expenditures |or 
the campaign increased from about $150,000 in 
1952 to about $420,000 in 1956. 

In 1956 the Board of Directors of Lettuce, Inc. 
commissioned Stanford Research Institute to eval- 
uate the effect of their promotional campaign upon 
lettuce consumption. A proviso was issued to “‘let 
the chips fall where they may.” 


METHOD 

The impact of the campaign was evaluated by 
two methods. 

First, a series of interview surveys was made with 
lettuce brokers, wholesalers, retailers and consumers 
in two cities which differed primarily in the 
amount of C-7 advertising and promotion received 
per capita: Grand Rapids, Michigan, which re- 
ceived relatively much attention, and Butte, Mon- 
tana, which received little attention. F.o.b. buyers of 
lettuce in the Salinas area of California were also 
interviewed. 

Second, a multiple regression analysis was cat- 
ried out for 22 cities: Atlanta, Baltimore, Boston, 
Chicago, Cincinnati, Cleveland, Dallas, Denver, De- 
troit, Fort Worth, Jackson (Mississippi), Kansas 
City (Missouri), Los Angeles, New Orleans, New 
York, Oakland, Philadelphia, Pittsburgh, St. Louis, 
San Francisco, Seattle and Washington, D. C. This 
was done in an attempt to explain the city-to-city 
variation in per capita consumption of lettuce (de- 
pendent variable) in terms of various demand 
shifters (independent variables). 

In other words, it was assumed that fluctuations 
in per capita consumption of lettuce in a certain 
marketing area can be explained by the following 
functional relationship: 


L=£(L1,P,A...) 


This symbolic expression is shorthand for the fol- 
lowing idea: per capita consumption of lettuce (L) 
in a certain marketing area depends primarily on 
(is a function of) the area’s per capita income (I), 
temperature (1), retail price of lettuce (P) and the 
intensity of the advertising effort (A). Other varia 
bles that cannot readily be quantified are indicated 
by the dots. 
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‘TABLE | 


FACTORS ASSOCIATED WITH VARIATION IN PER CAPITA 
CONSUMPTION OF HEAD LETTUCE, 22 SELECTED U.S. CITIES, 1950-55 
(SOURCE: STANFORD RESEARCH INSTITUTE) 

Coefficients 
Partial 
Variables Units Mean Correlation — Regression 
nde pendent: 
1. Income $10,000 per capita per year 1493 .129 .1789 
2. Price Dollars per head of lettuce 1345 —.595* — 1.1159 
3. Temperature 100° Fahrenheit 7962 —.228* —.1321 
4. Radio spot 100’s of broadcasts 0161 —.057 —.0512 
5. Radio network 100’s of broadcasts 1183 —.154 —.0591 
6. Television 100’s of broadcasts 0212 120 415 
7. Fieldmen Ratings of effectiveness 0346 —.201* —.2905 
8. Wholesalers No. per 100,000 persons .0380 091 0428 
9. Retailers No. per 1,000 persons 0276 120 1838 
10. Direct mail No. per 1,000 persons 0048 152 1.2109 
1]. Newspapers Ratings of cooperation 0203 .237* 4578 
Dependent: 
Unloads of lettuce Carlots per 100 persons 1107 

* Significant beyond the .10 level of confidence. 

1. Income: Effective Buying Income from “Survey of Buying Power,” Sales Management magazine. ‘Vhe 
original data were adjusted to a 1947-49 constant dollar basis by the annual average consumer price in- 
dex for the respective city, or for the nearest comparable city. 

2. Price: Monthly averages from Retail Food Prices by Cities, Bureau of Labor Statistics, U.S. Department 
of Commerce. The Consumer Price Index, Series B-10 of the U.S. Department of Commerce provides 
comparable figures. The 1947-49 deflated monthly prices were weighted by monthly unloads to provide 
annual averages. 

3. Temperature: Averages of daily maxima, Local Climatological Summary, U.S. Weather Bureau. 

4-5. Radio: Number (in hundreds) of spot and national network broadcasts. 

6. Television: Number (in hundreds) of network broadcasts of the “Home” and “Mickey Mouse” programs. 

7. Fieldmen: Client rating: .10 (maximum), .07 (moderate), .03 (very little) and 0 (no contact). 

8. Wholesalers: Number of service wholesalers who actively supported the C-7 program. 

9, Retailers: Number of retailers who used promotions in their stores and ran newspaper ads featuring: 
C-7 lettuce. 

10. Direct Mail: Number of names on the C-7 mailing list. Approximately six bulletins were sent annually, 
announcing special events or asking for support of the C-7 program. 

ll. — Client rating of cooperation received: .10 (maximum), .07 (moderate), .03 (very little) and 

(none). 
The done variable was eliminated by expressing most data on a per capita basis. Radio and ‘TV adver- 
ising broadcasts were retained in original terms, because the media coverage was proportional to population. 
Data on monetary expenditures on individual media were not available. One single expression of the sales 
promotion and advertising effort could not be developed. Time was excluded, because the partial corre- 


lation coefficient corresponding to the time variable was only .052, or insignificantly small. All calculations 
were carried out with a standard program on the IBM 650 electronic computer. 


RESULTS other explanatory variables. ‘The regression coeth- 


Results of the regression analysis are shown in cient indicates the association between a_ unit 
Table 1. Shown for all variables are: units of change in the explanatory variable and lettuce con- 
ineasurement, means of observations, partial cor- sumption. Minus signs in front of the coefficients 
relation coefficients and regression coefficients. indicate a negative relation between the independ- 

The partial correlation coefficient indicates the ent and dependent variables. For instance, every- 
“closeness of association” between an individual thing else being equal, an increase of one per cent 
explanatory variable and lettuce consumption, in the retail price of lettuce led to a decrease of 
alter adjustments have been made for effects of about one per cent in lettuce unloads. 
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It appears that in 1950-55 only four variables— 
price, temperature, fieldmen and newspaper ad- 
vertising—had a significant influence on lettuce 
consumption. Of the remaining six advertising 
variables, it cannot be said with: confidence that 
they had a net effect on lettuce consumption. This 
does not mean that the C-7 program as a whole had 


-no effect. On the contrary: the multiple correlation 


coefficient between per capita lettuce consumption 
and all 11 explanatory variables was .66, but 
merely .60 if only price, income and weather were 
included. This means that the chances are better 
than 1:19 that the eight advertising variables had a 
significant joint effect. 

Specifically, it appears that at prevailing prices 
the C-7 program tended to increase demand for 
head lettuce by an annual equivalent of about 8.8 
carlots per 100,000 population. (The sum of the 
products of individual regression coefficients and 
their means came out to about seven carlots per 
100,000 persons per year. This estimate was in- 
creased by 27 per cent to allow for the fact that 22 
out of the 104 individual observations referred to 
years 1950 and 1951, when the C-7 program was 
not in effect.) 

Instead of a single equation approach, a simul- 
taneous equations technique might have been more 
appropriate. But in view of the relative crudeness 
of the original data inputs, it did not appear that 
the results would have been sufficiently more re- 
liable to warrant the extra time and effort re- 
quired by this rather sophisticated analytical tool. 

The C-7 campaign was the only national adver- 
tising activity for lettuce carried out at that time. 
Thus no adjustment for competitive promotions by 
lettuce producers in other areas was necessary. 

The results of the statistical analysis were con- 
sistent with the findings of the interview surveys, 
which showed that: 


1, Among consumers in areas reached by the cam- 
paign, an awareness of the C-7 campaign and de- 
mand for C-7 lettuce was created. 

2. The campaign influenced wholesalers and re- 
tailers to increase their selling effort on lettuce in 
general, and on C-7 lettuce in particular. 


A similar interview survey has also been done for 
asparagus (Mason, 1959). It showed that current ad- 
vertising and promotional efforts have stimulated 
sales, yet trade members have no real desire to 
push asparagus without industry aid. While pro- 
ceeding toward these conclusions, the study, com- 
missioned by the California Asparagus Growers, 
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unearthed extensive information on methods 0 
promotion, merchandising and handling, and oy 
the competitive position of California asparagus. | 
also provides model interview techniques for gen. 
erating comprehensive marketing and distributioy 
data. 


DISCUSSION 
An examination of the above analysis points to. 
ward interesting areas of future explorations. 


Other Variables 

The independent variables used in the multiple 
regression analysis explained only about half of the 
total variation in the per capita consumption ol 
lettuce: 38 per cent by price, income and weather] 
and an additional 12 per cent by the eight adver} 
tising variables (see Table 2). It is therefore evident 
that important factors which affect consumption o! 
lettuce were left out of the analysis. An attempt 
should be made to include additional variable} 
that would narrow down the unexplained or re 
sidual part of the variation. 


TABLE 2 


ANALYSIS OF VARIANCE IN PER CAPITA 
CONSUMPTION OF LETTUCE 


Percent Degrees 

Source of of Total of Mean F 
Variance Variation’ Freedom Squares Rati 
Non-advertising 

(variables 1-3 in 

Table 1) 38.0 100 — 
Advertising (vari- 

ables 4-11 in 


Table 1) 11.8 8 0148 2.69 


Total explained 

variation 49.8 92 
Total unexplained 

variation 50.2 92 .0055 


Source: Stanford Research Institute 


‘The partiel correlation coefficients, which in Table 1 are 
adjusted for degrees of freedom, were converted back to un 
adjusted form, and then squared. 


The Returns Curve 
Lettuce growers and shippers—often the same 
person or firm—advertise and promote so as to itt 
crease net revenue per crate f.o.b. shipping point 
and to broaden the market for lettuce. (An expres 
sion for f.o.b. shipping point prices would have 
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been helpful in supplementing the retail prices 
used in the original Stanford Research Institute 
study.) In other words, they want to know either 
how best to allocate a given advertising and promo- 
tion expenditure among geographic areas and me- 
dia so as to maximize sales, or how to minimize ad- 
vertising and promotion effort while achieving a 
given volume of sales. 

For this purpose one needs to estimate the shape 
and place of the “returns curve” for each area, 
which would show “the expected net value of in- 
creased sales due to promotion.” This principle was 
suggested more than two decades ago by Dr. Harry 
R. Wellman (1939) now vice president of the Uni- 
versity of California. Wellman originally used the 
term “‘sales curve” to mean the relationship be- 
tween the expected volume of sales and varying 
levels of promotional expenditures in a marketing 
area. Waugh (1959) prefers “returns curve.” 

The returns curve principle is illustrated in Fig- 
ure 3. Conceptually the three curves might repre- 
sent the expected increases in the net returns for 
lettuce in, say, three of the 22 marketing areas for 
which consumption data are available. All curves 
tend to flatten off; this is to indicate that there 
probably would be decreasing returns to promotion, 


FIGURE 3 
RETURNS FROM 3 KINDS OF 
PROMOTION 
< 
z 
> ' profitable allocation 
0 é, 
: EXPENDITURES 


Here is the functioning of this model as ex- 
plained by Waugh (1959, pp. 211-212): 


Suppose that the initial allocation of promotional funds 
is that shown by the circles. At these three points, the mid- 
dle curve is flatter (less steep) than either of the others. The 
Slopes of the curves measure marginal returns. In other 
words, the initial allocation of promotional funds, the mar- 
ginal returns from market 2, are less than those from either 
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market | or market 3. Suppose that a dollar of promotional 
funds were transferred from market 2 to market 1. The loss 
in net returns from market 2 would be more than offset by 
the gain from market 1. The same is true of a transfer of a 
dollar of promotional funds from market 2 to market 3. 
Such transfers of funds would always be profitable so long 
as the marginal returns from any pair of markets were un- 
equal. Thus, the equalization of marginal net returns is a 
necessary condition for maximum net income. 

One could shift funds and observe graphically the slopes 
of the several curves. In time, the slopes of all would be 
equal—that is, where tangents to the curves were parallel. In 
Figure 3, such a situation is shown by the squares, corre- 
sponding to expenditures é,, @, €,, in markets 1, 2, and 3. 
Assuming that the three curves accurately measure the re- 
sponses to advertising in three markets, and assuming a pro- 
motional budget of (€, + é + @,) dollars, se most profitable 
allocation of that budget is €,, to market 1}, é@,, to market 2 
and é,, to market 3. 

No one can successfully dispute this principle. It is a geo- 
metric fact, like the Pythagorean theorem about right tri- 
angles. It is true—exactly, inescapably, eternally. The only 
question is how to estimate the returns curves. This must be 
done, in one way or another, if the problem of optimum 
allocation is to be solved, (Italics added—F.M.) 


Chances are that the sets of multiple regression 
analyses (demand curves) discussed previously will 
provide reasonable estimates of the returns curve 
for lettuce. 

Series of regression analyses have already been 
successfully used in an attempt to explain the fluc- 
tuations in f.0.b. shipper prices for western lettuce. 
Shipments from different production regions in the 
west, as well as from competing areas, and con- 
sumer income were used as explanatory variables 
(McGlothlin, 1957; Brennan and Toussaint, 1958). 


The Decay Curve 

The C-7 campaign was discontinued in 1957. 
This fact alone opens up new possibilities for em- 
pitical testing of the concept of the decay curve 
(also called distributed lags; see Friedman, 1957, 
and Nerlove, 1958) and its application to measur- 
ing the effect of advertising and sales promotion. 

Past work in this field has primarily focused on 
current relations between advertising and sales. 
This type of analysis is appropriate for evaluation 
of short time promotions such as weekend specials 
in food stores. However, it seems that most adver- 
tising that has been sustained over a long period of 
time does not show a significant effect till some 
time after the campaign has started. By the same 
token, it has some effect for months and perhaps 
years after the campaign ceased. In fact, recent 
studies (Alderson, 1958; Vidale and Wolfe, 1957) 
indicate that “old ads never die; they just fade 
away.” 
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All this suggests that the multiple regression 
analysis on lettuce should perhaps be ‘“‘decom- 
posed” into a number of equations, rather than 
one catch-all relationship. For instance, it appears 
possible to construct a series of demand curves by 
quarterly or semi-annual intervals for the years 
1950 through 1960. This would probably provide 
some indication of the gradual demand-shifting ef- 
fect of the C-7 campaign. Equations for the 1950-52 
years could serve as a “before” or control period; 
1953-57 might be used as a “during” period in 
which the build-up of the advertising effect could 
be traced; and the 1958-60 “after” period would 
presumably help to outline the decay curve. 


Media Evaluation 

Marginal returns will have to be equalized not 
only between individual marketing areas but be- 
tween advertising media as well. Many problems 
would be encountered, because data on advertising 
inputs are usually not available in sufficient de- 
tail. The problem can be minimized should adver- 
tising agencies alter slightly their cost accounting 
so as to retain enough detail. 

The analysis shown in ‘Table 1 has produced a 
number of interesting results which indicate the 
kind of information which media analysis could 
provide. For instance, although the joint effect of 
all eight of the promotional variables is positive, 
the regression coefhicients of some of the variables 
are actually negative. For radio spot and network 
advertising this may mean that these media tended 
to discourage lettuce consumption. But since 
neither of the coefficients is significant, such a state- 
ment is not warranted without additional investiga- 
tion. 

However, for fieldmen the regression coefficient 
is negative and significantly so. Here again it can- 
not be concluded that promotion by fieldmen ac- 
tually tended to decrease lettuce consumption. This 
is because the fieldmen worked indirectly by trying 
to induce wholesalers and retailers to cooperate by 
advertising in newspapers (using mats provided by 
Lettuce, Inc.) and in many other ways. Thus the 
regression coefficients for wholesalers, retailers and 
newspapers are positive. This can largely be traced 
back to fieldmen, because when these three varia- 
bles are left out from the equation, the effect of 
fieldmen turns out to be positive. 

In the following section we explain an experi- 
mental method by which such interactions could 
be systematically isolated. 
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Systematic Experiments 

A mathematical ;:odel based on historical cata 
can of course mereiy help to explain the past. It 
might have only limited predictive value. Yet the 
advertiser wants to know, within reasonable liniits | 
of error, what he can expect in the future if he al-| 
locates his funds in certain ways. To provide such 
knowledge the mathematical models will have to be 
supplemented with results from controlled niatr- 
keting experiments. In such experiments the size} 
and type of advertising investments can be system. 
atically varied, the influence of differences in loca- 
tion and time can largely be eliminated, and the 
interactions between media can be isolated. 

This approach is analogous to laboratory experi- 
ments in which the scientist measures the types of 
changes brought about in the dependent variable 
by systematically controlled changes in the causa- 
tive (independent) variable. It is not necessary to 
test the effect of one independent variable at a 
time; different experimental designs for analysis 
of variance are available, which make it possible to 
simultaneously determine the effect of several varia- 
bles. These designs are being routinely employed in 
physical as well as biological sciences. In_ fact, 
within the last decade some of the complex cross- 
over or rotational latin square designs (Brunk, 
1953; Brunk and Federer, 1953; Wood, 1960) have 
been successfully used in a long series of merchan- 
dising studies carried out under the auspices of the 
U. S. Department of Agriculture (Goldsborough, 
1959; Stockman, 1959; Wells, 1959). The experi- 
ments have primarily focused on measurement of 
the sales impact of alternative merchandising tech- 
niques in the fields of: packaging (type, size, color), 
display (type, size, location), product quality and 
price (Dominick, 1960), Curiously enough, one ol 
the earliest studies (Smith, 1955) dealt with an eval- 
uation of the impact of four methods of merchan- 
dising lettuce (bulk displays with two-head price, 
bulk displays with one-head price, cellophane bags 
with one- and two-head prices, and banded with 
semimetal tape with one- and two-head prices) on 
sales in 12 retail food stores in Pittsburgh. 

Since the lettuce study, which also included ex- 
periments with apples and tomatoes, work has been 
done on pears (Huelskamp, 1957; Smith et al, 
1955), cherries, carrots, bananas (Smith et al., 1956), 
peanut butter, creamery butter (Huelskamp. 1957), 
dates (Smith and Havas, 1959), canned foods 
(Grubbs et al., 1960) and pork (Naumann et al. 
1959). 
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By and large, the U. S. Department of Agricul- 
cata ure studies have focused on short-range effects of 
ast. It Kpecific merchandising or promotional devices. In 
‘et the Kontrast, an attempt to compare systematically the 
liniits \effects of two different advertising themes was con- 
he al-qucted in 1957 by the Washington State Apple 
€ such (ommission (Henderson, 1959). A rotational latin 
e to be kquare experiment was used in six midwestern 
| niar- ities. The following three “treatments” were tested 
he size Mor four-week periods in each city: a general health 
system: ftheme, a specific apple advertising theme, and no 
n loca: |advertising. In other words, at any particular time 
nd the during the experiment all three treatments were 
conducted simultaneously in three pairs of the six 


experi: ‘cities, with ten of 12 retail stores in each city. Data. 


pes of fon sales and prices of apples and competing fruits 
ariable \and competitive advertising effort were also ob- 
causa | tained. 
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at 

nalysis An American approaches a Frenchman 
ible to near the Seine. “Do you speak English?” 
-Varia- the American asks. “A little,’ the French- 
yed in man replies. “How deep is this river?” 
1 fact, the American asks. “I’m sorry, I don’t 


know.” “How fast ts the current?” “I don’t 


Brunk, know that either.” “Well, how far is it 

have across?” “I’m sorry.” “You don’t know 
rchan- much about this river, do you?” “It has — 
of the beauty, and it’s soft and it moves,’ the 
rough, Frenchman replies. “On gray days, the 

x peri- river, too, is gray. Yet, when the sun sets, 

ent olf the water looks golden. This much I know. 

x tech: This is important to me.” 

color), —Europe, 1960, Look, August 16, 1960, p. 17. 
y ne! 1. It seems that the regression method holds 
one ol 


promise as a tool for quantitative evaluation of the 
sales producing effect of advertising. Chances for 
rchan- success appear particularly good for commodities 
price }with certain attributes similar to those of lettuce: 
e bags} major part of total production is confined to a 
| with) few rather well defined areas; per capita consump- 
es) On| tion trend is upward; and the advertising campaign 
/is being carried out by one group of sponsors, with- 
ed €x-Jout simultaneous confounding effects from other 
s been) promotions. Products such as California avocados, 
et al. clingstone peaches, walnuts, cantaloupes, turkeys, 
1956), fresh oranges and lemons, artichokes and dates 
1957), ; would fall within this category. 

foods| 2. The Stanford Research Institute evaluation 
et al.) was requested only after the campaign had been 
going on for several years. By then a number of 
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n eval- 


statistics were unintentionally “let out of the barn,” 
and could not be rounded up again in suitable 
form. This experience clearly indicates that for 
best results the process of evaluation should be in- 
corporated into the campaign plan. This means 
that the data on advertising inputs (i.e., expendi- 
tures on individual media) should be broken down 
into geographic areas that coincide as closely as 
possible with the marketing areas for which sales, 
population, income, price and other such data are 
available. In most cases this type of matching of 
input and output data will require practically no 
additional bookkeeping. Yet it would increase con- 
siderably the reliability by which statisticians would 
be able to explain “what has happened.” If the sales 
data were matched with qualitative measurements of, 
say, attitude, preference, awareness—as was done 
in the case of lettuce—the explanatory power of the 
statistical analysis could be enhanced still further. 

3. Although careful historical analyses will prove 
extremely useful, they alone have only a limited 
predictive value. In order to be able to sav any- 
thing about “what most probably will happen” 
controlled marketing experiments will be required. 

4. Finally, a word of caution. A careful integra- 
tion of the currently available historical quantita- 
tive and qualitative techniques with a systematic 
program of controlled experimentation will build 
up a body of general principles. These wouid help 
to bring the evaluation of advertising further into 
the realm of science. ‘This does not mean that ad- 
vertising will entirely cease to be an art and sud- 
denly become a matter of mechanics. Far from it. 
The element of creativeness and inspiration §re- 
quired in generating crucial hypotheses is often a 
matter of unexplainable flashes of insight—or ar- 
tistic inspiration, if you please. 

In other words, we cannot afford to let ourselves 
be blinded by the promising possibilities of quanti- 
tative measurement. A balanced view of the role of 
advertising obviously requires inclusion of the per- 
haps somewhat more esoteric yet equally crucial 
contributions that are made by the experienced 
practitioners in the behavioral sciences. The psy- 
chologist, the anthropologist and the sociologist 
help in the process of choosing media, structuring 
the form and substance of the message, in building 
up awareness and eventually creating the motiva- 
tion to buy. Without properly reflecting this often 
unquantifiable leg-work that precedes a sale, the 
statistician would be just another panacea peddler. 
He might well end up with little to measure, and 
nothing to enjoy. 


ty. 
. 
ss, 
— 
: 
9 


REFERENCES 


ALpERSON, Wroe. The Productivity of Advertising Dollars. 
Cost and Profit Outlook, Vol. 11, No. 3, February 1958. 


AMERICAN MARKETING ASSOCIATION, NORTHERN CALIFORNIA 
Cuarter. Newsletter. American Marketing Association, 
1960. 


BASEMAN, Rosert L. Application of Several Econometric Tech- 
niques to a Theory of Demand with Variable Tastes. 
Unpublished Ph.D. dissertation. Ames: Iowa State Uni- 
versity Library, 1955. 


BRENNAN, N. Carson AND W. D. Toussaint. An Economic 
Analysis of North Carolina Lettuce Marketing Practices, 
Information Series #64. Raleigh: North Carolina Depart- 
ment of Agricultural Economics, February 1958, pp. 34- 
36. 


Brunk, Max E. Controlled Experiments in Retail Merchan- 
dising. Journal of Farm Economics, December 1953, pp. 
916-923. 


BruNK, MAX E. AND WALTER T. FEDERER. Experimental Design 
and Probability Sampling in Marketing Research. Jour- 
nal of the American Statistical Association, Vol. 48, No. 
263, September 1953, pp. 440-452. 


Dominick, BENNETT A. Research in Retail Merchandising of 
Farm Products: Appraisal of Methods and Annotated 
Bibliography. In Marketing Research Report $416. Wash- 
ington, D. C.: U. $. Department of Agriculture, July 1960. 


Fox, Kart A. Demand Expansion and Agricultural Adjust- 
ment. Ames, Iowa: Center for Agricultural Adjustment, 
Spring 1959, pp. 123-153. 


FriEDMAN, Mitton. A Theory of the Consumption Function. 
Princeton: Princeton University Press, 1957. 


Grorcr H. Experience in Designing Research 
Aimed at the Evaluation of Agricultural Promotion Pro- 
grams. In Promotion of Agricultural Products: Tech- 
niques, Evaluation, Theory, Policy. Conference Proceed- 
ings of Committee on Agricultural Marketing Research 
of the Western Agricultural Economics Research Coun- 
cil, April 1959. 


Gruss, VIOLET et al. Effectiveness of Selected Canned Food 
Displays in Supermarkets. In Methodology Supplement 
to Agricultural Marketing Reports #371. Washington, 
D. C.: U. S$. Department of Agriculture, April 1960. 


HENDERSON, PETER L. Measuring Effect of Promotional Pro- 
grams on Sales of Apples and Other Selected Agricultural 
Commodities. A talk presented to the Washington State 
Apple Commission’s Marketing Advisory Committee, 
Wenatchee, Washington, November 10, 1959. 


Hoos, Sipney. The Advertising and Promoticn of Farm Prod- 
ucts—Some Theoretical Issues. In Advancing Marketing 
Efficiency (Lynn H. Stockman, Ed.). Chicago: American 
Marketing Association, 1959, pp. 189-214. 


Hopkin, JouN A. Commercial Vegetable Production in Cali- 
fornia. San Francisco: Bank of America, 1960. 


Huetskamp, Henry J. Evaluation of Retail Merchandising 
Practices for Pears, Peanut Butter and Creamery Butter. 
In Agricultural Marketing Report #180. Washington, 
D. C.: U. S. Department of Agriculture, June 1957. 


10 


MAINWARING, THOMAS A. An Evaiuation of the Effective nes 
of the Advertising and Promotion Program Conducteé 
by Lettuce, Incorporated, Project No. 1-1800. Palo Alto, 
Calif.: Stanford Research Institute, 1956. 


Mason, R. HAL. California Asparagus Marketing Study, ro}. 
ect No. I-2793. Palo Alto, Calif.: Stanford Research Insti- 
tute, September 1959. 


MATHES, SORRELL M. AND G. CLARK THOMPSON. The Art and 
Science of Advertising Measurement. Business Record, 
February 1960, pp. 22-25. 


McGLOTHLIN, Ropert S. Price Relationships in the Wester 
Lettuce Industry, Bulletin #287. Tucson: Arizona Agri- 
cultural Experiment Station, June 1957. 


NAuMANN, H. D. et al. A Large Scale Merchandising Ex peri: 
ment with Selected Pork Cuts, Research Bulletin #711. 
Columbia: Missouri Agricultural Experiment Station, 
August 1959. 


NERLOVE, Marc. The Dynamics of Supply. Baltimore: Johns 
Hopkins University Press, 1958. 


SmirH, HucH M. Merchandising Studies in Supermarkets-\ 


Apples, Lettuce, and Tomatoes (A Preliminary Report), 


Washington, D. C.: Agricultural Marketing Service, U. } 


Department of Agriculture, March 1955, pp. 5-9. 


SMITH, HuGH M. AND Nick Havas. Displaying Dates in Pack: 
ages of Different Sizes. In Agricultural Marketing Repori 
#349. Washington, D. C.: U. S$. Department of Agricul: 
ture, November 1959. 


SMITH, HUGH M. et al. Merchandising of Selected Food Item; 
in Grocery Stores—Canned Red Sour Cherries, Carrots, 
and Bananas. In Agricultural Marketing Report #111, 
Washington, D. C.: U. S. Department of Agriculture, 
February 1956. 


W. 


easy ta 


SmitH, HuGH M. et al. Merchandising Winter Pears in Retaill ence d 


Food Stores. In Agricultural Marketing Report #102, 
Washington, D. C.: U. S. Department of Agriculture, 
September 1955. 


STOCKMAN, LYNN H. Economic Theory and the Promotion o! 
Agricultural Products. In Promotion of Agricultural 
Products: Techniques, Evaluation, Theory, Policy. Con- 
ference Proceedings of Committee on Agricultural Mar- 
keting Research of the Western Agricultural Economic 
Research Council, April 1959. 


VipaLe, M. L. anp H. B. Wore. An Operations Research 
Study of Sales Response to Advertising. Operations Re 
search, Vol. 5, No. 3, July 1957. 


WAUGH, FREDERICK V. Needed Research on the Effectiveness 
of Farm Products Promotion. In Advancing Marketing 
Efficiency (Lynn H. Stockman, Ed.). Chicago: American 
Marketing Association, 1959, pp. 189-214. 


WELLMAN, Harry R. The Distribution of Selling Effort 
Among Geographic Areas. Journal of Farm Economic. 
January 1939. 7 


Wetts, O. V. Agricultural Market Promotion—Problems. 
Scope, and Policy. In Promotion of Agricultural Prod: 
ucts: Techniques, Evaluation, Theory, Policy. Conference 
Proceedings of Committee on Agricultural Marketing 
Research of the Western Agricultural Economics Re: 
search Council, April 1959. 


Woop RESEARCH CorRPORATION. The Measurement of Advertis 
ing Effect. Philadelphia: Wood Chips, May 1960. 


| 


audien 
and it 
which 
coverag 

Aud. 
ence d 
seldom 
for a ce 
the un 
pends 1 
ously 
taneou: 
made i 
readers 
magazi 

used, a 
ence is 
survey 
Suppor 
sidered 
tions w 
two—as 
of the 
quired 
tions p 


JOURNAL OFBADVER 


= 


tivenes 
nducted 
lo Alto, 


y, Proj: 
Tusti- 


Art and 
Record, 


Western 
Agri- 


Ex peri: 
in 
Station, 


Johns 


rarkets-\ 
Report), 
ce, U. s} 


in Pack; 
Report 
Agricul: 


Items 
Carrots, 
rt #111. 
iculture, 


n Retailf 
rt #102, 
iculture, 


otion 
icultural 
cy. Con’ 
ral Mar: 
onomic 


Research 
ions Re 


ctivenes 
‘arketing 
merican 


g Effort 
onomics, 


roblems, 
al Prod: 
mn ference 
[arketing 
nics Re: 


A dvertis- 


HEN SEVERAL VEHICLES (magazines, newspapers, 
W ere) are used in an advertising campaign, it is 
easy to sum their audiences. But because of audi- 
ence duplication, this sum A always exceeds the 
audience C actually reached by the combination, 
and it is precisely this unduplicated audience C 
which must be measured in order to evaluate the 
coverage of the campaign. 

Audience analyses sometimes give data on audi- 
ence duplication for vehicles taken two by two; 
seldom do they give data on audience duplication 
for a combination of three or more. But the size of 
the unduplicated audience of several vehicles de- 
pends not only on the number of people simultane- 
ously covered by two, but also on the number simul- 
taneously covered by 3, 4... . n vehicles. Analyses 
made in the U. S. and in France have shown that 
readership of three or more vehicles, particularly 
magazines, is very common. 

Therefore, when more than two vehicles are to be 
wed, as usually happens, the unduplicated audi- 
ence is rarely known. However, the press readership 
survey made in France by the Centre d’Etude des 
Supports de Publicité (CESP) in 1957 may be con- 
sidered a remarkable effort in this field. Duplica- 
tions were obtained for 30 magazines taken two by 
two—as well as for all possible combinations of 15 
of the 30 magazines! Three large books were re- 
quired to present the results of the 32,767 combina- 
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How to Estimate Unduplicated 
Audiences 


J.-M. Acostin1 
Elvinger Advertising Agency 


Monsieur Agostini has developed a formula which may render un- 
necessary the laborious tabulation of the unduplicated audiences of 
combinations of three or more media vehicles. It has been success- 
fully applied to both French and U. S. magazine data. 


These data are obviously very useful, but refer 
only to half the publications covered by the survey. 
They do not show the unduplicated audience of 
any magazine combination which includes a maga- 
zine not among the 15 chosen. 

The purpose of this paper is to describe a short- 
cut method of estimating the unduplicated audi- 
ence of a higher order combination of vehicles (trio, 
quartet, etc.) from data on the duplication of these 
vehicles taken two by two. 

When applied to the CESP results, the proposed 
method leads to a satisfactory evaluation of the un- 
duplicated audience provided by any of the 1,073,- 
741,793 combinations possible with the 30 maga- 
zines whose paired duplications are known. 


JEAN-MICHEL AGOSTINI is 
marketing consultant and a di- 
rector of the Elvinger Advertising 
Agency in Paris. He received the 
degree of Ingénieur des Arts et 
Manufactures from the Ecole 
Centrale de Paris in 1942. After 
several years as a consultant in 
market research, he joined El- 
vinger in 1951 as directer of re- 
search before assuming his pres- 
ent position. He is vice president 
of the Association Nationale des 
Practiciens en Etudes de Marché, 
member of the board of the Asso- 
ciation pour le Developpement des Techniques d’Etudes de 
Marchés, chairman of the Collége Echanges d’Expériences 
of the Institut de Recherches et d’Etudes Publicitaires 
(IREP) and a member of the technical committee of the 
Centre d’Etude des Supports de Publicité (CESP). IREP 
and CESP perform in France many of the functions under- 
taken by ARF in the United States. 
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METHOD 

Let: 
a, b,c, . . . n be the vehicles of the combination, 
Ay, Ap. A, be their respective audiences, and 


A the sum of these individual audiences. Thus, 
+A 


Owing to duplication, the unduplicated audience C 
is smaller than the sum of the audiences of the 
vehicles, i.e., 


(2) C = 2A where 
zis a coefficient between 0 and 1. 


The proposed method consists in calculating z 
from the square table giving the duplication of 
media taken two by two. This table shows the audi- 
ence duplicated by two vehicles at the intersection 
of their row and column: D,, is the number of 
people simultaneously covered by a and b. 


a b 
ai 


For obvious reasons this table is symmetrical 
about its main diagonal, D being the half-sum of all 
the terms in the table. 


(3) D=D, Dae Dan 


©fc. 


Thus D represents the total of the two by two 
duplicated audiences. In the case of magazines, D 
represents the total of the double readerships. We 
have, on the other hand, the total duplicated 
audiences: 


Suppose we let 
D 


(4) 


Now x is a factor which can be easily calculated 
starting trom data already in our hands: the total 
audience of all vehicles and their two by two dupli- 
cations. 

We shall establish that there is a relation between 
x and the factor z of equation (2). 


(5) Gx) 
12 


We shall prove that this relation may be satisiac. 
torily represented by a unique curve the tracing o/ OBSE 
which will be given. 
Knowing x, the value of z will be found on the 


curve, and this equation (2) will give a fair estimat — 
of the unduplicated audience C, which is the objec 
of our endeavor. 0.8 
THE RELATION BETWEEN x AND z 0.7 
Obviously the greater the duplication between 
media, the less will be the unduplicated audience. 0.6 
The higher the value of D, the lower the value of €. 
Inasmuch as x =P and z =©, when x increases Zoos 
z decreases. A 


When there is no duplication between media, the 0.4 
unduplicated audience equals the sum of the audi. 


ences. In this case, when D = 0; x = 0, and when 0.3 
A, 
Therefore when x = 0, z = 1. 0.2 
Results released by the CESP enabled us to cal. 
culate the values of x and z for any combination o! oes 


the 15 magazines for which the special study on un 
duplicated audiences was conducted. 

To calculate x for a given combination, we di- 
vided the duplication between the magazines (a 
obtained from the two by two table) by the sum | 
of audiences of the magazines. 

To calculate z, we divided the unduplicated audi} toward 
ence C of the same combination (as actually ob}have tl 
served in the special study) by the sum A of audi}, Accc 
ences of the magazines. | 6) 

Theoretically we are in a position to work out} 
32,767 pairs of values for x and z. Actually we cal} We 
culated only 98, selected at random from all com/**Pres€ 
binations studied. on the 

With x on the abscissa and z on the ordinate, wi observe 
plotted 98 dots, the setting out of which clearly ‘the cur 
shows that the relation between x and z can be ac} h 
curately depicted by a continuous curve (see Fig} — 
ure 1). Let 

Such a curve was traced through the 98 dots. ‘Th luplice 
accuracy of the graphic tool thus obtained was ven{?" the 
satisfactory. The difference between the value of J"48e ¢ 
read from the curve and the true value was in 9! Supy 
per cent of the cases less than .01 and never mor tudien: 
than .02. mind 


We then tried to translate this curve into al 
equation. The tracing of the curve looks like ai 
equilateral hyperbola asymptotic to the x axis. Ob} 
viously when vehicles are added to a combination. 
the sum A may increase indefinitely while C mus 
necessarily tend toward a limit, i.e., z must tent 
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FicureE | 
‘ing ol OBSERVED RELATIONSHIP BETWEEN UNDUPLICATED (z) AND PAIR-WISE DUPLICATED (x) 
! COEFFICIENTS OF THE TOTAL AUDIENCE 


0.3 


0.2 


0.1 


toward zero. ‘The curve z= f(x) must therefore 


lly ob: 


have the x axis as its asymptote. 
Accordingly we tried an equation of this form: 


We also know their two by two duplications: 
S JF VC ND 


of audi} 

l 

ork out Kx + 1 

we cal, We found that when K = 1.125, equation (6) 

com} epresented perfectly the section of the curve traced 
on the chart up to x = 1.7, the highest value of x 

1ate, wef ‘observed in magazine combinations used to obtain 

clearly the curve. 


Selection 

Jours de France 
La Vie Catholique 
Nous Deux 


We calculate 


638 


697 
186 
283 


1 be ach 
see Fig 


a 


ots. “The 


vas 


lue of | 
is in 
er mort 


PRACTICAL EXAMPLE 


| Let us, for example, take four magazines, the un- 


iluplicated audience of which has been calculated 


n the CESP special study. This will enable us to 
udge the accuracy of the proposed graphic method. 
Suppose we have to estimate the unduplicated 
udience of the following four magazines, knowing 
he individual audience of each: 


into ally Readers 
like ai (in 000’s) 
xis. Ob} Selection 4,741 
ination, Jours de France 1,573 
C mus La Vie Catholique 2,447 
ist tend Nous Deux 4,143 
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A = 4,741 + 1,573 + 2,447 + 4,143 = 12,904 
D = 638 + 663 + 697 + 275 + 186 + 283 
= 27/42 


hence x a = 0.2] 
A 


On the curve we read the z \alue for x = .21, 
namely, 

z = .805 
hence C = zA = .805 (12,904) = 10,400 
The true value of C given by the CESP special study 
was 10,468. 


OTHER APPLICATIONS 

The curve z = f(x) on which the method is based 
was derived empirically from data on 15 French 
magazines. We have found that estimates obtained 
from the curve closely match the results actually 
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obtained in the CESP special study. But this is no 
proof that the curve is valid for other media than 
those which served to build it. We therefore decided 
to test the proposed method beyond those 15 
magazines. 

The CESP survey tabulated the unduplicated 
audience of a few groups of media other than the 
15 chosen magazines. When the two by two dupli- 
cations are given as well, it is possible to apply our 
method and compare the estimate obtained with 
the actual result. We conducted three tests with: 

a. five women’s magazines (of which only one 
was part of the 15); 

b. five general magazines (of which none was 
part of the 15); 

c. eleven Paris dailies. 

In the three cases, the proposed method has led 
us to a fairly close estimate of the unduplicated 
audience: 


Estimated Actual 
Audience Audience Difference 
5,700,000 5,837,000 28%, 
; 6,200,000 6,210,000 0.2% 
c. 6,000,000 6,160,000 2.6% 


All media previously mentioned are French pub- 
lications the audiences of which were measured by 
this same CESP survey. It seemed advisable to test 
the proposed method with U. S. magazines. This we 
were able to do thanks to the 1954 Life report, A 
Study of Duplication. In this study, audience dupli- 
cations of five U. S. magazines (Life, Saturday Eve- 
ning Post, Look, Ladies’ Home Journal, This Week) 
were analyzed. 

Specifically, unduplicated audiences were given 
for all combinations of three, four and five maga- 
zines. In each case these audiences were analyzed 
according to sex and income. We thus have a total 
of 96 results regarding unduplicated audiences—16 


General observations drawn from particulars are the jewels of knowledge, 
comprehending great store in a little room. 


combinations for each of six groups (one over-all, 
two sexes and three income levels). The same audi. 
ences were evaluated by applying our method with 
the same constant, K = 1.125. Results: 


In 82 cases, the difference between the actual 
audience and the estimated audience was less than 
one per cent. 

In ten cases, this difference lay between one and 
two per cent. 

In four cases, this difference exceeded two per 
cent (maximum 2.7 per cent). 

It appears that the proposed method has a wide 
field of application. 


CONCLUSIONS 

If it is true that the only information required to 
estimate an unduplicated audience is the total au- 
dience and two by two vehicle duplications, then 
great economy of effort is possible. Instead of under- 
taking the fantastic amount of tabulation required 
to calculate the unduplicated audiences of all f ossi- 
ble combinations of a restricted number of vehicles, 
it is preferable to set up, for all vehicles, a series of 
square tables of their two by two duplications, each 
table containing these duplications within one 
stratum. It should be noted that to evaluate the 
unduplicated audience of a combination of vehi- 


N 
cles within a stratum (e.g., an age group), one must A 


know the duplications within this stratum. Knowl- 
edge of duplications for the whole sample is not 
sufficient. 

We feel that an analysis of the two by two dupli- 
cations by strata for any individual audience analy- 


sis is much more efficient than a repetition of such 
voluminous work as the CESP special study. We}. 
must not forget, however, that if this tremendous 
enterprise had not been undertaken, we would not 
have found a method to save us from doing it in 
the future. 
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N IMPORTANT GOAL in determining the size of an 
A advertising budget is to maximize profits. ‘To 
wchieve this goal, even approximately, is a very 
fomplex problem. Many procedures or rules have 
been adopted to provide what appears to be a rea- 
jonable answer from one point of view or another, 
but which, most people in the business would agree, 


of such 
dy. We 
endous 
uld not 
1g it in 
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re very crude at best. Hence, such approaches as 
‘percentage-of-sales,” “‘all-you-can-afford,” “objec- 
ive-and-task” and “‘competitive-parity” are used be- 
ause “scientific” approaches either don’t exist, or 
ave not been able to prove their superiority. 

The difficulties of devising a scientific approach 
te fairly well known. The multitude of factors in- 
olved and the scarcity of relevant and accurate 
ata are only two of many which could be listed. 
but work is being done to develop good methods 
nd, with the increasing interest that researchers are 
baying to the problem, there is no doubt that real 
ogress is being and will be made. 

The purpose of this paper is to present a scheme 
br determining the sales response to varying adver- 
sing outlays for a very simple case. It is hoped that 
¢ method can be extended to more complex and 
erefore more realistic cases once the underlying 
tinciples are thoroughly understood. 
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A Switch-Over Experimental Design 
to Measure Advertising Effect 


R. J. JESSEN 


University of California at Los Angeles 
and C-E-I-R, Inc. 


The influence of advertising may endure long after its placement. Dr. 
Jessen describes an experiment which takes account of such delayed 
effects while measuring sales response to current advertising outlay. 


Although we are concerned here with the re- 
sponse of sales to different amounts of advertising 
outlay, it should be relatively straightforward, at 
least in principle, to proceed to the determination 
of the optimum outlay. ; 

Suppose we represent the general relationships 
existing between the advertising outlay of a firm, 
its consequent sales and profits as those shown in 
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turer in business statistics at 
UCLA and project director at 
C-E-I-R, Inc. in Los Angeles. He 
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1943. Until 1947, he was an agri- 
cultural statistician at the U. S. 
Bureau of Agricultural Econom- 
ics. For the next ten years he 
taught statistics at Iowa State 
University while consulting on 
survey techniques with the Rocke- 
feller Foundation, United Nations, Tennessee Valley Au- 
thority and U. S. Departments of Agriculture and the 
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sized gauging the cost and effectiveness of sampling tech- 
niques. He is a fellow of the American Statistical Associa- 
tion and a member of the American Marketing Association, 
Biometric Society and Inter-American Statistical Institute. 
He also serves on the A.S.A.’s three-man Technical Com- 
mittee on Broadcast Ratings. 
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GENERAL RELATIONSHIP OF REVENUE 
AND PROFITS TO ADVERTISING OUTLAYS 
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Advertising Outlay ($ millions) 


Figure 1. To keep our model simple, let us assume 
that the “best” use is made of the advertising 
budget at each outlay level and that disturbances 
such as seasonal variations, general state of the mar- 
ket and the economy, competitors’ activities, etc., 
are held constant. 

Sales are expected to increase, but not in propor- 
tion to advertising expenditures. Increasingly heavy 


doses of advertising are expected sooner or later to 


meet with a diminishing increase in sales. If gross 
profits are defined as simply Sales Revenue less all 


costs except those for advertising, we obtain the 


Gross Profit curve (which may be negative for low 


advertising outlays). Next, a curve is shown repre- 


senting increasing Advertising Costs, and lastly, 
the Net Profit curve which is simply the Gross Profit 
less Advertising Costs. Net Profit is a maximum at 
Point “P” where $2.75 million net profit is made 


with an advertising budget of $4.25 million. Any 


other size of budget will yield less profit. 

This paper is concerned with a procedure for de- 
termining the top curve—the response of sales to 
total advertising outlays. The use of the budget 


among the different media and the particular kind 
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of copy presented need not be fixed. The condiiion 
which must be satisfied will be mentioned later. 
The remainder of the paper will present: 


1. a brief statement on basic statistical require. 
ments for planning and conducting experiments; 

2. special problems in dealing with advertising 
budgets; 

3. a layout for a switch-over type design; 

4. some methods of estimating “carry-over” eflects: 

5. the results of a hypothetical experiment which 
estimate these effects; 

6. discussion of the applicability of this experi. 
mental design. 


METHOD 
Experiments 

A controlled experiment is an investigation in 
which the experimenter chooses which treatment to 
apply when and where, and then observes the te: 
sponse. ‘The design and analysis of complex experi 
ments is a branch of applied mathematics largel 
originating in the biological sciences and in ee 
years is gradually appearing in a number of other 
sciences. Its penetration into economics and studie 
of human behavior has been particularly slow- 
partly because of the complexity of many of the 
problems of these fields, and partly because of un 


familiarity with modern developments in moder 
experimental techniques. 

It was a long-held scientific doctrine that goo 
experimental procedure was to study one variabl 
at a time holding all others constant. ‘Thus, on 
might study the relationship of the occurrence 0 
stomach ulcers in monkeys to kinds, intensity an¢ 
frequency of stress by varying the intensity whil4 
holding kind and frequency constant. Having de 
termined this relationship, one would vary fre 
quency while holding kind and intensity constant 
and so on. Such a procedure is not only quite waste 
ful of time and resources, but provides insufficient 
information. If there is an interaction between in} 
tensity and frequency, it would never be discovered 
by this method. The correct method is to determin¢ 
the effects of intensity, frequency and kind, ané 
their interactions simultaneously in a single well) 
planned experiment. 

To design experiments appropriate for a give 
investigation requires some familiarity with exper 
mental design as an art and science, and some of th¢ 
relevant lore of the field in which it is being apy 
plied. If the problem is such that standard tec} 
niques, even with modification, will not apply, if 
may be advisable to devise new procedures. 
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Advertising Budgets 

There are a number of special problems that im- 
mediately become apparent when one considers 
good experimental procedure for measuring the ef- 
fects of advertising budgets. Often these problems 
have made some would-be experimenters shy away 
with the remark that experimentation may be all 
right for agriculture in testing for yield effects from 
different fertilizer treatments or in engineering re- 
search, but it just can’t be used in the social 
sciences. 

Listed below are some of these problems, with 
comments: 


1. What is a “dose” of advertising expenditure? ‘The 
amount of dose can be varied by changes in frequency, cov- 
erage, space units (e.g., color versus black and white), copy, 
ete. 

2. Carry-over effects. After an advertising treatment is re- 
moved, there is still a residual effect left which will continue 
to influence sales for awhile. 

3. Seasonal effects. Sales may have a pronounced seasonal 
pattern and it may be necessary to remove this influence. 
Also, a given dose of advertising given in the spring may 
bring a response different from the same dose given in the 
fall. 

4. Geographical effects. Responses to advertising differ 
from one locality to another due to income structure, influ- 
ence of competitors’ position, etc. 

5. Competitors’ programs. If a competitor changes his ad- 
vertising during the course of an experiment it may confound 
results. 

6. Shifts in consumers’ behavior. If the experiment takes a 
long time to complete, basic consumer behavior in regard to 
advertising may change. 

7. Other factors affecting response. Changes in weather, the 
world politicai and economic situation, etc., during an experi- 
ment may cause trouble. 

8. Measuring sales. Records on sales may not be available 
for experimental markets or for the time periods desired. 
Perhaps there have been price changes in the product, or a 
new premium scheme has been launched. 

9. Interference of the experiment with going operations. 


A properly designed experiment would have to 
take problems of the sort listed above into account 
or else it is doomed to failure. 


A Simple Switch-Over Design 

Consider the following illustrative problem. A 
company has sales outlets in 85 market areas into 
which it divided the U.S. (to make subsequent uses 
here simple), and its advertising is carried out 
locally in each of its market areas. Suppose its ad- 
vertising budget in each market area is now $1,000 
per month. What would be the sales response to 
budgets of $2,000 per month? To $3,000 per month? 
These outlays would correspond roughly to annual 
expenditures of one, two and three millions (see 
Figure. 1). 

Denote test areas by 1, 2, 3, etc., time periods by 
I, 11, IT and budget levels by A, B, C, where the 
lower budget rate (A) is $1,000, the medium (B) is 
$2,000 and the high (C) is $3,000. 
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We wish to put the treatments A, B and C on 
each area during the experimental period in order 
to eliminate area differences in our comparisons. It 
is reasonable to believe that the order of treatments 
will affect results; that is, the sales response in each 
period to an advertising sequence of ABC will likely 
be different from that of CBA. (This would mean a 
$1,000-$2,000-$3,000 sequence as contrasted with 
a $3,000-$2,000-$1,000 sequence.) Since there is a 
total of six possible sequences, a minimum of six 
areas will be required for the experiment. ‘These 
can be arranged into two groups of three areas each, 
the areas having similar seasonal patterns being put 
together into some group. With this design, each 
group can be regarded as a separate experiment 
known as a 3 x 3 latin square. ‘There will be two 
basic types to cover all possible sequences (see Fig- 
ure 2). 

Figure 2 
3 x 3 LATIN SQUARE & ITS COMPLEMENT 


Area 

1 2 3 

I A B c 

Period qT B ¢ A 
I Cc A B 

Area 

4 5 6 

I A B 

Period I c A B 
Il B C A 
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For our purposes it may be better to combine the 
two squares into a single experiment with rows 
(periods) regarded as constant throughout the U.S. 
In this case the breakdown of degrees of freedom 
for the purpose of determining ‘ experimental er- 
ror” would be as follows: 


Source of Variation 
Treatments 
Periods 


Sequences 


Residuals 
Error 


Total 7 

In a later section, suggestions are given for the ap- 
propriate method of computing the mean squares 
of this table for this design. 

Carry-Over Effects 

The switch-over type design can be useful in 
eliminating errors due to area differences, both in 
the average main effects of the budget treatments 
and in the average seasonal effects, by balancing 
one effect against another. If carry-over exists, the 
unadjusted observed effects will be biased. Adjust- 
ments will be required if estimates free of bias are 
to be made of what the sales effect for various 
budgets would be if they were carried out for ex- 
tended periods of time. 

The nature of carry-over effects in the proposed 
experiment may be illustrated by extending our ex- 
ample. Suppose our company, whose advertising 
sales-profit functions are shown in Figure 1, carried 
out an advertising budget experiment on a set of 
six test market areas chosen by some randomized 
scheme. A switch-over type design was used where 
three levels of budgets were run in the six possible 
sequences shown in Figure 2. A response to a given 
budget level is read off the response function in 
Figure 1. For example, if the budget in a given 
market area is $1,006 per month, it would corre- 
spond to a national annual level of $1,000,000 and 
would produce an expected response of $1,000,000 
in sales at the national annual level, or $1,000 per 
month in the individual market area. Likewise, 
budgets of $2,000 and $3,000 per month would pro- 
duce expected sales responses of $3,400 and $6,300 
per month, respectively. 

In Figure 3, the response to each budget is shown 
for the two cases: (1) with and (2) without response 
lag or carry-over. The broken lines indicate the 
response for each period if there were no lag; the 
small circles (connected by solid lines) the average 
response for the period considering that some sales 
may lag somewhat behind the input of advertising 
effort. It is assumed that experimental periods for a 
trial run at a given level are so short that some sales 


Degrees of Freedom 


18 


triggered by advertising in one period occur during 
the subsequent period when a new level is being 
put to test. It is being presumed that the ad. 
vantages of a short period are great enough to make 
it worthwhile to complicate the statistical task that 
this design entails. 

The general effect of lags is to obscure the under 
lying relationships. For example, in box 3.1 of Fig. 
ure 3, the test area is presumed to have been run at 
the $1,000 budget level (A) for the period before 
the experiment is regarded underway. The sales 
response in period I in both the case of no-lag and 
lag-response, is the same—that is, $1,000 per month. 
In period II, instant responses would provide sales 
of $3,400 per month, but with lag, the observed 
response is $2,680. Likewise, in period III the values 
are $6,300 and $5,430, respectively. In box 3.2 note 
that lag reduces the sales in period II, but increases 
it in III when the budget was reduced from the 
high to a medium level. 

To look at the nature of response of sales to an 
advertising stimulus, let us consider a very simple 
model. Suppose we consider what happens if we 
put a small “dose” of advertising into a market. 
Following the advertising, we might expect that 


sales would rise to some peak and then trail off. We. 


might represent this relationship by the simple 
graph at the top of Figure 4, where a dose of adver- 
tising was applied in the middle of period I and 
sales increased from zero to a maximum by the end 
of the period, two-fourths later, then trailed off to 
zero again in another two-fourths of a period. The 
total sales resulting from this dose of advertising is 
represented by the area under the curve (or the 
sum of the columns). This is the total response. 
However, only half the sales occurred in the period 
when the advertising was carried out. A “residual” 
of sales carried over to the subsequent period. 

If we consider a series of doses during a given 
period and let each respond in this way inde. 
pendently of each other, we have a picture like that 
at the bottom of Figure 4. Advertising outlays made 
during period I will build up to a maximum (in 
this case by the end of the period), and if the out 
lays are continued through period I, no further 


gain is made. Hence, in period II, a kind of equilib- 


rium is reached where sales are at a maximum for 
this level of advertising outlay. If the advertising is 
withdrawn in period III, the sales will trail off to 
zero again. Later, we shall denote the sales in the 
first period as D, the direct response; those in the 
second period as T, the total response; and_ those 
in the third period as R, the residual response. : 
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Figure 4 
SIMPLE MODEL OF SALES RESPONSE TO ADVERTISING IN DOSES 


| Period I 


unit 


time 4 


Period I 


RESULTS 


To provide illustrative material for Figure 3, a model was 


developed to represent the lag or carry-over of some adver- 


tising. Suppose we represent the amount of sales response 
observed in a given market, during a given period and at a 
given budget level, as a sum of a number of component effects: 


Yow = 4+ 414+ 
where, 


Yiyx = the observed response in the it" market area during 
the j‘" period when treatment k is being applied fol- 
lowing treatment k’, 

D, = the actual response to treatment k during its period 
of application, or “direct” response, 

R,’ = the actual response to treatment k’ applied to the 
preceding period occurring during the present pe- 
riod, or “residual” response, 

a, — the differential effect of the it" market area, 

Py = the differential effect of the j'® period, and 

€ijx = all other effects, to be regarded as residual “error.” — 


Since we are interested in the total sales effect of a given 


advertising application, we need to determine both D and R 
for a given treatment; that is, how much we get currently 
and how much subsequently. If we denote total response of 


treatment k by Ty, then 


Tr +R, 


Period II | 


Period II Period II 


Time 


Ty. As an illustration of the kind of data required and the 
computations involved, let us continue with the example 
used so far. For the true values of Ty, we have for treatment 
levels A, B and C: 


T, = 1.0 (thousands of dollars per month) 
‘T, = 3.4 (thousands of dollars per month) 
‘Te = 6.3 (thousands of dollars per month) 


Now, let us assume that direct and residual portions of the 
total responses’ to advertising are: 


D, = wT, 
R, = (l— w) T, 


where w is simply the fraction of the total response occurrin 
during the period of the treatment, and (1 — w) is the remain 


ing fraction occurring in the subsequent period. If we assum 


for our example that w = .70 then, 


‘Treatment 


k Ty dD, R, 
A 1.0 70 

B 2.38 10 
Cc 6.3 441 18! 


The observed responses for each of the six sequences of the 


three treatments is simply 
dD, + R,’ 


The switch-over tvpe of design, with modifications, will 
permit unbiased estimates to be made of the total responses, 
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where R,’ is the residual from the previous treatment. The 
\,'s for our example have been determined and plotted it 
Figure 3. 
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Let us assume that the period effects are: 


pP: = +0.1 (thousands of dollars per month) 
p:= 0.0 (thousands of dollars per month) 
Ps = —0.1 (thousands of dollars per month) 


This means that during period I the sales response for $1,000 
per month of advertising outlay will be $100 per month more 
than the average period; whereas, in period III, it will be 
$100 per month less. 

For market area effects we have (in thousands of dollars 
per month) 


a, — —0.3 +0.1 
a, = —0.2 a; = +0.2 
as = —0.1 a, = +0.3 


That is, market area 1 during any period always responds 
with $300 less sales per month for $1,000 advertising outlay 
than the average market area; whereas market area 2 is $200 
less, etc. 

Assuming residual errors are zero and aggregating the ef- 
fects from the several sources, we have in Table 1 the values 
which would be observed in our switch-over experiment with 
the six test market areas. (Figure 3 is based on these data, 
except that area and period effects were omitted.) 


TABLE 1 
“OBSERVED” SALES IN THE 
SWITCH-OVER EXPERIMENT 


Market Area 
Period 1 a 3 Total 
1 A— 0.80 B= 3.30 C= 6.30 10.40 
2 B= 2.38 C= 5.23 A= 249 10.10 
3 C= 5.03 A= 2.29 B= 2.48 9.80 
8.21 10.82 11.27 30.30 
4 5 6 Total 
1 A= 1.20 B= 3.70 C= 6.70 11.60 
2 C= 4.81 A= 1.92 B= 4.57 11.30 
3 B= 4.27 C= 4.81 A= 1.92 11.00 
10.28 10.43 13.19 33.90 


It may be of interest to see if the basic response data fed 
into the system can, in fact, be recovered. To do this, calcu- 
lations of the following quantities are required: 


H = total responses for all period-areas on which treat- 
ment k was put. This is the unadjusted total for each 
treatment. 

F = total responses for all period-areas immediately fol- 
lowing the one on which treatment k was put (in- 
cluding the pre-experiment treatment). 

S_ = total of sequences where treatments occurred last. 

Sy = total of sequences where treatments occurred first. 

G = grand total of the experiment. 

y = grand mean of the experiment. 


The computed values of these quantities are shown in ‘Table 2. 
' To obtain an estimate of T,, the total sales attributable to 
the kt" advertising budget, both direct and residual, will be 
given by the estimator,’ 
T, = — + — (3F —G + S, — 
6 12 


1 The author is indebted to Dr. M. R. Mickey for this formula and 
for the appropriate method for computing ‘‘experimental error’ for 
this design. 


For treatment A, the budget rate of $1,000 per month, we 
obtain as an estimate of total sales, 


10.621 
T, = —— + — [(8) (16.48) — 64.20 4 24.01 — 18.49] 
6 12 


10.62 9.24 


6 12 
= 1.00 


Hence, we have “recovered” the correct value of treatment A 
from our dummy experiment. In a similar manner the cor- 
rect responses for treatments B and C can be estimated. 

If we define the differential direct and residual effects in 
the system as 


ad, = D, — Dand = 


. respectively, where D and R are the means of the component 


effects respectively, then the true values for these quantities 
in this example are: 


‘Treatment dy Ty 
A —1.80 —O.77 
B —0.05 
Gc +1.92 +0.82 


and D = 2.50, R = 1.07 and T = 3.57. To estimate r, we have 


= — [3F — G + S,, — 
12 


and 


ry = — [(3) (16.48) — 64.20 4 24.01 — 18.49] = — 0.77 
12 


‘To estimate d, we have 

hence dy = +, Tk 

and dy = 1.00 — 3.57 4 0.77 = — 1.80 


where T is estimated by y, the grand mean of the experiment. 
In general, we cannot estimate D and R individually with 
this design, but it is not of importance here anyway. 

It should be noted that certain properties are peculiar to 
this design. ‘The treatments are applied during the pre-exper- 
imental period so that observations made during the first 
period include residuals of known type. This also permits a 
simple method of estimating what is important here, the 
total responses ‘T, as well as the components, d, and 1. An 
estimate of w is also available from the experiment itself. 

The variance of ‘T, is given by 


Var = — 
3 


where g* is the appropriate experimental error for this design 
and 

A 
Covar (T,, Ts) = — — 


12 


TABLE 2 
COMPUTATION SUMMARY OF TOTAL, DIRECT AND RESIDUAL EFFECTS 


Treatment H F Si 
A 10.62 16.48 24.01 
B 20.70 21.10 21.55 
Cc 32.88 26.62 18.64 
‘Totals 64.20 64.20 64.20 
G = 64.20 


y = 64.20/18 = 3.567 
ADVERTISING RESEARCH 


Sp dy Tr 
18.49 1.00 —1.80 —0.77 
21.25 3.40 —0.12 —0.05 
24.46 6.30 41.92 +0.82 
64.20 10.70 0.00 0.00 
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The appropriate method of computing the sums of squares 
for the analysis of variance table given in the previous sec- 
tion is conventional for all sources except residuals. Here the 
sum of squares is 


4 + Ta? + Te’) 


The rather serious bias which can arise if the data are not 
treated properly, and which obscures the true relationship of 
sales to outlay, may be examined in this case. The unadjusted 
treatment totals are given by the T’s in the table below. From 
these we can compute the unadjusted treatment means. ‘Thus, 
for the biased estimates of T, we have, 


= 1062/6 = 1.77 
Ts’ = 20.70/6 = 3.45 
Te! = 32.88/6 = 5.48 


FIGURE 5 
DISTORTION FROM CARRY-OVER EFFECTS 
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Advertising Outlay (Treatment Groups) 


When plotted against the true values (see Figure 5) the 
nature of the bias becomes apparent. In this case the response 
to low levels of outlay are overestimated and the high levels 
are underestimated. If guided by such results one may miss 
the true optimum level by a wide margin. 


DISCUSSION 


The several steps still remaining will only be 
briefly mentioned here and perhaps dealt with in 
some detail in a later paper. The next step in this 
plan is to compute the Net Profit curve from the 
estimates of the portion of the Gross Sales curve 
covered by the experimentation (Figure 4), and 
from estimates of production costs. An examination 
of the information should be helpful to determine 
where the advertising budget should be set for all 
non-experimental market areas, and what levels 
should be chosen for further experimentation. 

Note in Figure 6 that our illustrative experiment 
explored only a small portion of the Net Profit 
curve. Future experimentation would test higher 
levels of advertising outlay in order to seek the 
maximum at M. (Bear in mind that the experi- 
menter would not know the portion of the curve 
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shown by a broken line.) He may be quite cautious, 
thinking that level C is near maximum already. 
The strategy to use at this point could be an inte’- 
esting study for the future. 

It is foolish to think that these relationships are 
fixed and once determined, will form the basis for 
all future action. Rather, it is more reasonable to 
regard the relationships as undergoing constant 
changes and that a business organization is faced 
with the problem of constantly seeking out the 


relationships currently in effect so that it can be- . 


have most rationally in regard to them. This re- 
quires that it keep up constant experimentation to 
determine where these curves are—perhaps, some 
day, to forecast them. 

Some businesses will immediately see that a plan 
such as this one requires that sales be known by 
both market area and period. This will be a serious 
limitation unless measures are taken to obtain such 
information either from commercial services or by 
organizing their own. The advisability of either 
depends on expected gains against costs. 


FiGuRE 6 
THE NET PROFIT CURVE 
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Advertising Outlay 


The basic ideas herein can be extended in many 
ways to deal with specific problems. For example, in 
a manner similar to that presented, an experimental 
plan could be made which would evaluate the 
effectiveness of different media rather than budget 
levels, or a combination of both. A more elaborate 
design could evaluate both media and budget levels. 
It appears that the response curves for different 
levels of inputs in the several media may differ 
somewhat, but what is probably more important 
are the interactions of media. Certain media mixes 
may have a total effect different from what the sum 
of the component parts might indicate. These 
differences can also be isolated and measured by 
appropriately designed experiments. 
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The Influence of 


Newspaper Tune-In Advertising on 
the Size of a TV Show’s Audience 


JaMEs C. BECKNELL, JR. 
E. I. du Pont de Nemours & Company 


Twenty-one cities were randomly assigned to groups receiving nor- 
mal, double and no tune-in advertising for a special TV program. 
Coincidental viewing measures indicated that the ads had no effect 
on audience size, but may have served to decrease audience variability. 


ee ADVERTISER Who decides to hawk his wares 
on national television is committing himself 
to the expenditure of considerable sums of money. 


_ Anything he can do to increase the size or qual- 


ity of the audience should be evaluated by him, 
and accepted or rejected on a sound cost basis. The 


advertiser spending half a million dollars to buy 


the attention of 20 million people would be foolish 
if he did not spend an additional ten thousand for 
one million more. However, the advertiser spending 
only 100 thousand for the same 20 million people 
might be foolish to spend an additional ten thou- 
sand for a five per cent audience increase. 

This, of course, assumes that the probability of 
purchasing (or behaving in some other desired 
manner) is the same for the last million as it was for 
the original 20 million—a completely ridiculous 
assumption. Obviously some people are far better 
targets for an advertising message than others. One 
may ask whether advertising is more efficient when 
it selects the “right” people rather than when it 
gathers a large number of viewers indiscriminately. 
This study does not attempt to answer this question. 

Rather, it asks: after a given expenditure for 
time and talent, what can an advertiser do to in- 
crease viewing? Several avenues are open. He can 
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improve the quality of his program (assuming that 
the expenditure of additional monies will do this). 
He can advertise his program in spot television 
messages, in newspapers, in magazines or anywhere 
he chooses. The purpose of this study is to evaluate 
only one of these possible methods for increasing 


_ audience size, advertising the show’s presence on 


the television page of newspapers the day of the 
show. 

The logic behind promoting a special television 
program seems infallible. People can’t get into a 
regular habit of watching such a program so they 
must be made aware of it. Awareness comes either 


JAMES C. BECKNELL, JR. is a 
research psychologist in the Ad- 
vertising Research Section of E. I. 
du Pont de Nemours & Company. 
He has worked for R. H. Macy 
as an executive trainee and as- 
sistant buyer, and for the A. J. 
Wood Research Corporation as a 
project director. He received a 
B.S. in economics from the Uni- 
versity of Pennsylvania in 1952, 
an M.A. in general psychology 
from the University of Arkansas 
in 1957, and is presently com- 
pleting work for the Ph.D. in 
psychology at the University of Pennsylvania. He is a mem- 
ber of the American Psychological Association and the 
American Statistical Association. 
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through their own volition (actively searching tele- 
vision listings) or by having it thrust upon them. 
The latter can happen either by their accidentally 
tuning in the program, or by noticing some form 
of announcement that the program will be shown 
at a given time. The same reasoning may hold par- 


tially for regularly scheduled shows, but it is not so_ 


compelling. 

The post hoc experiment or analysis of historical 
data is one method often used for evaluation of a 
promotional technique. At best, post hoc experi- 
mentation can yield only hypotheses to be tested 
later in a completely designed study. This is espe- 
cially true in the case of unusual events such as TV 
spectaculars which may differ widely in appeal. 
Whether a spectacular was or was not advertised 
depended on a variety of reasons. If there were any 
reason other than random assignment of shows to 
the categories, “promoted” and “not promoted,” 
post hoc analysis is inappropriate. 

For this reason, the decision was made to conduct 
a field experiment to determine the effect of news- 
paper tune-in advertising on the Du Pont “Show 
of the Month” run in April 1960. 


METHOD 

Twenty-four cities where tune-in advertising is 
normally run for the “Show of the Month” (on the 
television page of the newspaper the day of the 
show) were selected and stratified according to news- 
paper circulation. Three of these cities were elim- 
inated: New York and Chicago, where the pattern 
of advertising differed from that of the other cities, 
and the Philadelphia-Wilmington area because the 
Du Pont home office is located there. ‘The rationale 
behind stratifying on newspaper circulation rather 
than size of city was the desire and appropriateness 
of equalizing cost. Stratification could have been 
accomplished by the cities’ former viewing pattern 
of the show but this was rejected because of within- 
city variance. 

After the stratification, cities were grouped by 
threes and randomly assigned to one of three ex- 
perimental conditions. Group A, which received the 
normal size tune-in advertisements (300 lines), con- 
tained the following cities: 


Detroit 
Minneapolis-St. Paul 
Pittsburgh 

Kansas City, Mo. 
Milwaukee 
Cincinnati 
Columbus 


Group B, which received no advertising, contained 
the following cities: 


Boston 

Atlanta 
Washington, D. C. 
Cleveland 

Denver 

Buffalo 
Seattle-Tacoma 
Houston 


Group C, which received a double size advertise- 
ment (600 lines) contained the following cities: 


San Francisco-Oakland 
Baltimore 

St. Louis 

Miami 

Indianapolis 
Dallas-Ft. Worth 


Because of the random process used for assignment, 
Minneapolis-St. Paul and San Francisco-Oakland 
were separated into different groups. Because of the 
newspaper overlap in these areas, San Francisco and 
St. Paul were swapped to obtain geographical har- 
mony, and the ensuing pairs were randomly reas- 
signed. These pairs are counted only as one ob- 
servation. 

On April 21, 1960, the day of the Du Pont “Show 
of the Month’”—‘‘Years Ago’”’—an advertisement (see 
Figure 1) was run in the newspaper which the com- 
pany’s advertising department and agency nor- 
mally considered to be the best buy in each city. 
That evening the American Research Bureau of 
Beltsville, Maryland made approximately 375 tele- 
phone calls during the “Show of the Month” in each 
of the 21 cities. An average of 66 per cent of the calls 
were completed in each city. The standard devia- 
tion around this 21-city mean was 4.7 per cent. ‘The 
data of this study are based on these interviews. 
Previous ARB data on the November, December, 
January and March “Show of the Month” were 
utilized in making statistical adjustments and in 
determining the over-all variance estimate in the © 
analysis. 

Two hypotheses were specifically tested. ‘They — 
were stated in the null form: 


-1. There will be no differences in the mean view- 
ing levels among cities which had no tune-in 
advertisements, 300-line tune-in advertisements, 
or 600-line tune-in advertisements. 

. There will be no difference in the variance 
estimates for the groups of cities which had no 
tune-in advertisements, 300-line tune-in 
tisements, and 600-line tune-in advertisements. 
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RESULTS 

Table | shows the mean proportions of television 
sets (of all television sets) reported to be tuned in to 
the April “Show of the Month” by ARB's coinci- 
dental interviews. Also shown is the mean value for 
the previous four shows’ average score. There were 
no significant differences among the experimental 
treatments either during the test show or during 
the preceding periods. But the difference between 
the April show and the average of the previous 
shows was significant. It is interesting to note that 
the slight superiority of Group C (double advertis- 
ing) in April is even more pronounced in the aver- 
ages for the previous four shows when all groups 
received the same advertising. As shown in Table I, 


‘TABLE | 
MFAN PER CENT CLAIMED VIEWING 


City Four Shows’ April — 


Group April Average Average 


B 1357 16.5 817 
A 13.8 18.3 757 
Cc 15.6 19.1 822 


In April cities in Group B received no tune-in advertising; 
Group A_ received regular advertising; Group C_ received 
double the regular advertising. The average ratings of the 
three groups in April did not differ significantly from one 
another. All three groups, however, received significantly 
lower average ratings in April than in the previous four 
months, when regular tune-in advertising had been applied 
to all of them. 


ratios of the April ratings to the average ratings 
from the previous four shows bring out even more 
strikingly that there was no effect of newspaper 
tune-in advertising on total audience size. This was 
true not only of the proportion of all sets as shown, 
but also of the proportion of sets in use. 

To test the second hypothesis, the decision was 
made to compare the variance from each of the ex- 
perimental groups against the best over-all estimate 
we had of variance for the population of cities. ‘This 
was done by testing the variance estimate of each 
subgroup against the over-all variance estimate for 
the population of shows included in the average 
scores for the preceding four shows, plus the 
“normal” advertising condition for the April show. 
This test found that the variance between cities 
within the experimental group that received no 
tune-in advertising was significantly greater than 
the variance normally expected in this population. 
This same tendency was also seen in the other two 
experimental groups. However, because of a lack of 
degrees of freedom, it cannot be said to be signifi- 
cant, 
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FicureE 1 
THE TUNE-IN AD TESTED 
DU PONT 


SHOW MONTH 


TONIGHT 


presents 


ROBERT PRESTON 


in a rollicking comedy 


“YEARS AGO” 


by Ruth Gordon 
Starring 


PEGGY CONKLIN 
SANDRA CHURCH 


as Ruth Gordon Jones 


The hilarious story of a would-be actress, her explosive 
father, her gentle mother—and a gift that changed their lives. 


Produced by David Susskind 
Directed by Alex Sega! 


“SPECIAL” 8-9:30 TONIGHT Ch. 2 
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TABLE 2 
VARIANCE OF RATINGS AMONG CITIES 
Advertising 
Level No. of Cities Variance 
None 8 39.5 
Regular 7 22.5 
Double 6 13.6 


The variance of ratings among cities with no 
tune-in advertising was significantly (P >.99) 
greater than the variance (11.5) among cities with 
regular advertising. To calculate this variance of 
cities with regular advertising we have considered 
all 21 cities for each of the previous four shows as 
providing independent ratings. To them we added 
the ratings of the seven cities which received regular 
advertising in April. 

The “Show of the Month” began at 8:30 p.m. in 
some cities and 9:30 p.m. in others. The average 
rating for the 9:30 group (16.5) was significantly 
greater than that for the 8:30 group (9.4). The vari- 
ous advertising conditions, however, were fairly well 
distributed over the two time zones. 


DISCUSSION 


On this evidence hypothesis one cannot be rejected. 
That is, there were no data which indicated any re- 
lationship between tune-in advertising and the level 
of television viewing. 

Hypothesis two, however, must be rejected. It 
was a surprising finding that the variances of the rat- 
ings in the three groups of cities differed. We can 
offer one speculative explanation of this which re- 
quires some further investigation. 

The ratings in each city are samples of the pro- 
portion of all homes viewing, drawn at various in- 
stants in time; the rating refers to an interval made 
up of many such instants. A rating of 20 per cent, 
for example, refers to a time interval in which 
many people tuned in, many tuned out and an av- 
erage of 20 per cent viewed the show. 

Now it would be a great simplification to suppose 
that a TV audience is made up of only two classes 
of people: sustained viewers and knob-twiddlers. 
More accurately we could characterize the TV au- 
dience as consisting of people who are retuning at 
different rates. If all of them were sustained view- 
ers then there would be no variation in the ratings 
over time; if all of them were active knob-twiddlers 


- 


the ratings would vary considerably from one mo- 
ment to the next. We must remember, too, that this 
rate of knob-twiddling together with the character- 
istics of the show will determine the show’s rating. 

In those cities without tune-in advertising the 
usual full spectrum of tuning habits may be in ex- 
istence. The “Show of the Month” gets an average 
rating of about 14 per cent but the actual percent- 
age of viewers at various instants of time oscillates 
around this figure as people tune in and tune out. 
The ratings from seven such cities reflect this vari- 
ance in each separate rating. 

With tune-in advertising, however, some of the 
knob-twiddlers are saved from twiddling; some de- 
termine in advance to become sustained viewers, 
others are repelled beforehand. In short there are 
now less twiddlers left to twiddle; consequently the 
ratings will show less moment to moment variation. 
This in turn reflects itself in less variance between 
the ratings drawn from those cities subjected to 
tune-in advertising. 

Apparently, advertising brings people into the 
audience who would like to see your show, but at 
the same time it screens out others who may not. If 
an advertiser can pre-judge the popularity and 
drawing power of the show he is presenting on tele- 
vision, then these results indicate the following. If 
the show is quite popular and a large proportion of 
the viewing public should want to see it, then 
tune-in advertising should increase the total au- 
dience. If a show is judged to be unfamiliar to the 
public and the title has little drawing power, the 
wisest course of action may be not to advertise and 
let chance viewing tendencies take over. 


CONCLUSIONS 

Tune-in advertising had no effect on the number 
of sets viewing a low-rated “Show of the Month.” 

We infer that tune-in advertising does select an 
audience—both by attracting viewers and by driv- 
ing them away. It may be a highly effective method 
of getting an audience who want to see your pro- 
gram, and eliminating people who are “trapped” 
into seeing it. 

We further infer that tune-in advertising may ac- 
tually be wasteful if you wish to expose your adver- 
tising message to individuals who would not nor- 
mally watch the show. 
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FEDERAL STATISTICS 
IN ADVERTISING 


Incrip C. KILDEGAARD 
ARF Research Statistician 


OUND ADVERTISING STRATEGY requires prediction. 
S What sales do you expect from impulse buyers, 
from buyers who plan ahead, 
from those who merely need 
replacements? How many are 
predisposed buyers to be di- 
rected to your brand, and 
how many must also be di- 
rected to your product 
group? The answers rarely 
exist, so you still have to 
guess. 

But if you know how many 
people own your brand and how many say they in- 
tend to buy it, your predictions are bound to im- 
prove. Happily, some of this latter information is 
available from the Bureau of the Census. 


Consumer Possessions 

A rich vein in the 1960 Census gold mine will be 
the data on household possessions. The Bureau 
plans to release 51 advance reports—for 50 states 
and the District of Columbia—on the number of 
owning households by state, county and standard 
metropolitan statistical area. While the informa- 
tion was collected from a sample in the 1960 Census 
of Housing, the sample size is adequate for provid- 
ing reliable estimates for counties over 2,000 in 
population. 

Here are the nine possessions to be reported: 

Air conditioning (one room unit, two or more room 
units, central system, none) 

Automobile available (one, two, three or more, none) 

Clothes dryer (gas heated, electrically heated, none) 

Clothes washing machines (wringer or spinner, auto- 
matic or semi-automatic, washer-dryer combination, 
none) 

Heating equipment (steam or hot water, warm air 
furnace, floor wall or pipeless furnace, built-in 
electric units, other means with flues, other means 
without flues, none) 

Home food freezer (one or more, none) 

Radio sets (one, two or more, none) 

Telephone available (yes, no) 

Television sets (one, two or more, none) 

These county estimates will be of obvious value 
to the advertiser of any of these products (or one 
closely related, as is packaged bleach to washing 
machines). Knowing market saturation, he can as- 
sign sales potentials, apportion advertising expendi- 
tures and plan promotion according to the needs 
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and opportunities of individual areas. ‘The owner- | 
ship data will help the researcher who wishes to se- 
lect sets of comparable areas for advertising experi- 
ments. 

Of more general interest will be the information 
on radio and television ownership by area. Adver- 
tisers who buy or conduct telephone surveys will 
find the estimates of telephone households of espe- 
cial interest. 

The reports will be issued from May through 
October 1961. Called Advance Reports of Selected 
Equipment and Facilities, 1960 Census of Housing, 
Series HC (A4), they can be ordered from the Su- 
perintendent of Documents, U. S. Government 
Printing Office, Washington 25, D. C. 


Consumer Buying Intentions 

More consumers said they were going to buy automobiles 
and television sets within six months in October 1960 than 
in the previous July. On the other hand, fewer said they 
planned to buy washing machines and refrigerators. These 
differences could not be attributed to changes in consumer 
finances, for reports of past and expected changes in income 
were comparable for the two months. 

These interesting findings are from the Quarterly 
Survey of. Consumer Buying Intentions, conducted 
by the Bureau of the Census for the Federal Re- 
serve System, and reported in the Federal Reserve 
Bulletin. Begun in January 1959, the survey reports 
consumer intentions and attitudes toward purchases 
of durable goods. The first year covered intentions 
and actual purchases of new and used cars, tele- 
vision sets, refrigerators and washing machines. In 
1960 the list was expanded to include houses, air 
conditioners, clothes dryers, dish washers, and 
phonograph and radio equipment costing over $100. 

In brief, survey results show seasonal patterns in 
buying plans, with greater variation between quar- 
ters than between years. Comparisons of buying 
plans and purchase rates show differences by in- 
come and age groups, though seasonal patterns are 
apparent within each group. The incidence of 
purchase among planners is several times greater 
than among non-planners. But most consumers be- 
long to the non-planning group and their pur- 
chases account for the greater proportion of the 
total. How closely plans and purchases correspond 
depends upon the item considered, the time inter- 
val specified and the planner’s degree of certainty. 

While it would be naive to translate intentions 
directly into sales, there may be relationships be- 
tween sales and intentions which permit the use of 
these data for predictive purposes. Knowledge of 
such relationships could aid the advertiser im- 
measurably. 
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ON METHODS 


Gwyn CoLLins 


ARF Research Mathematician 


This note is prompted by a recent issue of the 
Public Opinion Quarterly! which was entirely de- 
voted to “attitude change.” It consisted of eight pa- 
pers on this and related topics contributed by psy- 
chologists and sociologists distinguished in this 
field. The contents of the Public Opinion Quar- 
terly, which will be familiar to many readers, are 
literary rather than analytical in temper, speculative 
rather than empirical in content. We will consider 
an alternative approach to this same subject based 
more on the daily necessity of using measures of at- 
titude change than by the desire to organize the 
concept of attitude change into some theoretical 
structure. 

First, and obviously, attitude change implies the 
measurement of attitudes on at least two occasions. 
For this reason, we will be mainly preoccupied with 
the problems of measuring attitudes and giving 
some meaning to what we measure. There are many 


definitions of attitude current. These are just a few: 


A neuropsychic state of readiness for mental and physical 
activity. 

The specific mental disposition toward an incoming (or 
arising) experience, whereby that experience is modified; or, 
a condition of readiness for a certain type of activity. 

A mental and neural stace of readiness organized through 
experience, exerting a directive or dynamic influence upon 
the individual’s response to all objects and situations with 
which it is related. 

In day to day practice, of course, this kind of 
definition does not help very much. It represents 
what the psychologist would like to measure or 
thinks that he measures; not what he does measure. 
These “neuropsychic states,” “mental dispositions” 
and “neural readinesses” have no physical existence 
but are inferred from some kind of observation. 
Since they convey no information that the observa- 
tion itself does not, we can, without loss; concen- 
trate on the observation itself and eschew psycho- 
logical inferences about what we think our measure 
“means.” Rather we will apply ourselves to what it 
will do for us. 


? Volume 24, Number 2, Summer 1960. 
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Attitude scales are used commercially because they 
enable us to predict an individual's course of be- 
havior. If a scale is found to be unrelated to the 
form of behavior we want to predict, we quickly 
abandon it. By the same token, we choose of several 
possible attitude scales that which enables us to 
make the surest prediction. 

It may be that a certain form of question, a spe- 
cial kind of interview, or an ingenious kind of in- 
terview situation exerts a strong intuitive appeal. 
We might believe that one or other of these will 
give us insight into what the respondent “really” 
feels, “really” wants or is “really” thinking. But the 
aim of our scale is not to satisfy the intuition but 
to give high probabilities to predictions. “Depth” 
interviews, projective techniques and other moti- 
vational devices may well be powerful tools but 
their usefulness must be demonstrated by a close 
relationship between the information they produce 
and the behavior in which we are interested. These. 
relationships have to be established (by some kind 
of correlational technique) whatever the source of 
the attitude information and whatever kind of be- 
havior is to be predicted. This is why the use of 
special techniques such as “motivational” research 
does not enable us to work with any fewer respond- 
ents than we would use in more conventional 
surveys. 

In the past measurements of attitudes have been 
taken with regard to such diverse entities as Ne- 
groes, the Democrats, war, and Campbell’s Soup 
and it has not always been quite clear what kind of 
behavior these scales sought to predict. Some may 
have been devised with the conscious intention of 
expanding an already inflated literature about at- 
titude scaling, though in general their authors 
would indignantly deny that their scales were un- 
related to any form of behavior. 

Frequently it would be claimed that the scale 
was not intended to measure behavior at all, but to 
measure an attitude that was an underlying source 
of behavior. This last is an unfortunate phrase for 
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it suggests that human behavior as we observe it is 
a second class phenomenon; a poor reflection of 
underlying psychological factors. It puts the cart 
before the horse. Behavior, whether the purchase of 
a brand, the answer to a question, or a rating of a 
preference is something that we can observe and 
study. The underlying “motive” is rather an elab- 
orate name for an observed correlation between 
different aspects of behavior. 

In what follows we will assume that the user of 
an attitude scale is interested in predicting some- 
thing very specific, usually purchasing behavior. 
With this in mind we will consider some of the 
scales that have been or could be used. We will con- 
sider two broad kinds of scales. One kind has been 
derived from theorizing about the underlying vari- 
able being measured. Such scales tend to rely on 
mathematical manipulations and are frequently 
awkward in construction, tedious in application. 
They have long been favorites of the psychologists. 
The other kind just happen to work. The better to 
appreciate the delights of the latter, we will first 
spend some time with the complexities of the 
former. 


Paired Comparisons 

Probably the most elementary of all scales is 
generated by asking the respondents to compare 
several products, two at a time. The procedure is so 
simple that it is often not regarded as an attitude 
measure at all but even this method reveals some 
intricate problems. 

Suppose the object of using this procedure is to 
predict the relative sales of each product. It is rea- 
sonable to hypothesize that those who prefer one 
product to all others will be more likely to pur- 
chase it and, in fact, we might expect the franchise 
of each brand to be equal to the proportion of peo- 
ple preferring it above all other brands. But life 
being what it is, some brands are not available in 
some places, some people change their minds and 
the position becomes a little more complicated. For 
this reason we become interested in people’s second, 
third and other choices in order to improve our 
predictions, 

Suppose now that we ask the first respondent 
about only four products A, B, C and D. If he 
prefers A to B we will denote it by A > B. His re- 
sults might be A > B; A> C; A> D; 

B>C;B>D; 
C>D. 
The judgments of this respondent enable us to 
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put his preferences for the four products in rank 
order: A > B > C > D. We know, or we can usu- 
ally show, that such a respondent will be more 
likely to buy A than its competitors, and if he does 
not buy A, is more likely to buy B than either C or 
D. Moreover, our degree of interest in his prefer- 
ences between B, C and D will vary with his likeli- 
hood of buying one of these rather than A. In 
short, we are interested not only that he prefers A 
to B but also want to know how much he prefers A 
to B. 

A difficulty that sometimes occurs in_ practice 
with this paired comparisons method is that a re- 
spondent’s judgments are not consistent. He might, 
for instance, prefer A to B and B to C yet perversely 
prefer C to A. One reason could be that in compar- 
ing A with B one dimension common to both, say 
size for example, tends to stand out and his judg- 
ment is made on the degree to which they each pos- 
sess this attribute. The same dimension may also 
be the most obvious for making judgments  be- 
tween B and C but when A and C are compared the 
outstanding quality on which they differ may be an 
entirely different dimension, say color, and_ this 
could make for inconsistencies in judgments. 

Since the researcher is interested in relating his 
scale to one aspect of behavior only, say probability 
of purchase, the existence of more than one dimen- 
sion of judgment is a nuisance. Often it can be 
avoided by asking the subject to rank order the 
products by considering them all at once rather 
than pair by pair. This, of course, does not remove 
the problem of having two dimensions mixed up in 
our scale but it does force each respondent to use 
only one dimension (though different respondents 
may use different ones) and often reveals two or 
more dimensions at the pilot stage of a survey by a 
respondent’s complaints about the difficulties that 
he has in performing the ranking. 


Rank Order 

‘The kind of procedure we have described only 
succeeds in putting products in rank order. From 
it we can have no idea whether the choice is a mere 
toss-up or the difference between night and day. 
We can give numbers to the rank order scale we 
have described by giving the product we like best 
one point, the next two points, and so on. But in 
assigning such numbers we may forget that though 
the difference between 3 and 4 is the same as be- 
tween 1 and 2 the difference between product A 
and B might seem to the respondent very great 
compared with the difference between C and D. 
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This raises an issue of great importance in at- 
titude scaling generally; the assignment of numbers 
to preferences or degrees of favorableness and un- 
favorableness. The number system with which we 
are familiar from elementary arithmetic has sev- 
eral properties which we have long taken for 
granted. Scales of measurement, particularly atti- 


tude scales, do not always have them and arithmeti- 


cal manipulations with attitude scales can some- 
times be misleading. Three special properties of 
numbers, as we know them from our elementary 
arithmetic, serve to distinguish different kinds of 
scales. 


Three Kinds of Scales 

First, the number system is ordered; any one 
number is always greater than, equal to, or less than 
each other number. This is the property which en- 
courages us to attach numbers to things which can 
be put in rank order. A scale which goes along with 
the number system this far is usually described as 
an ordinal scale. 

Second, the interval between any two numbers is 
itself a number. Because these intervals are num- 
bers they must be ordered (by the first property) 
and this means that the intervals between numbers 
can be compared with one another. Scales with this 
second property are often called interval scales. 
Measurements on this kind of scale are like points 
on an endless string. We can tel! which order they 
are in, we can tell the distance between them and 
we can even find the average positions of groups of 
points, but we cannot tell how far any of them are 
from the beginning of the string. 

Then some scales of measurement such as those 
for weight and length are such that the ratio be- 
tween any two numbers on the scale is a number. 
This makes it possible for us to speak of one weight 
as being twice as heavy as another, of one length 
as four times as long as some other length. It can be 
shown that scales of this type, generally known as 
ratio scales, always have a zero point corresponding 
to the total absence of the quantity being measured. 

Returning to our example of rank order scales, 
we see that they are ordinal but not interval scales. 
With them we cannot sensibly add the ranks given 
to products by several respondents. If we try we 
are misled. Suppose, for example, that we have only 
two respondents and they rank the four products of 
our example like this: 


Ist Respondent A 
2nd Respondent B 


> 
> 


Now the actual probability of the two respond. | 
ents buying the various products at the next pur- 


chase, which we usually will not know, could be 
like this: 


Ist Respondent: DCB A 
Certain not / Certain 
to buy ADCB to buy 
2nd Respondent: i 


If we assign scores to the ranks, say 4 to the first d 
rank, 3 to the second, 2 to the third and | to the last ~ 


we would, on adding the points for each product, 
get: 


This will make it look as though B is more likely 


to be purchased than A when from our diagram, ~ 
which contains information we usually do not have, | 


we know that this is not the case. 

Several attempts have been made to make rank 
order or paired comparisons scales into interval 
scales so that the answers from different respondents 
can be averaged. To do this we must find some way 
of ranking the intervals between scores in magni- 
tude. One method is suggested by what is known 
as Thurstone’s Law of Comparative Judgment. It 
states that in assigning scale positions to two prod- 
ucts, A and B, the size of the interval between them 
will be the product of two factors: the first depends 
on the frequency which A is preferred to B; the sec- 
ond is a function of the variability of judgments 
about A, the variability of judgments about B, and 
of the way in which judgments about A and B are 
related. Some assumptions are usually made to sim- 
plify this Law. First, the variability in judgments 
about each product is assumed to be the same; sec- 
ond, it is assumed that the common factors, if any, 
present in making pairs of such judgments always 
operate to the same extent. 

The effect of these assumptions is that the Law 
of Comparative Judgment reduces to a rule, 
namely, the distance between the scale positions of 
two products A and B will depend on the per- 
centage of judgments in which A is preferred to B. 
The distance between them is actually derived in 
this way: first, the block of proportions showing 
how often each product is preferred to each other 
is written out like this: 
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where the figures in the table show how often the 
product at the top of the table is preferred to the 
product at the side of it. 

Each of these proportions is then converted into 
the deviates of the normal distribution correspond- 
ing to them. The table then becomes: 


A B Cc D 
A —.05 —79 —1.65 
B 05 —30 —I1.13 
Cc 19 30 — 39 
1.65 1.13 39 
Sum of columns: 2.49 138 —70 —3.17 
Average of columns: 0.62 0.35 —.18 — .79 


= scale value) 


and the scale values are calculated as indicated in 
the second table. These values merely indicate the 
distance between scale points and we can, in the 
above example, add .79 to each to make all of 
them positive, for we can choose the zero point in a 
completely arbitrary way. 

That the difference between the scale position of 
two products should depend on the number of 
times one is preferred to the other seems a rea- 
sonable assumption to make. If all comparisons are 
made on the same dimension then when an ovcr- 
whelming number of people prefer A to B we can 
guess that the difference in their market popularity 
will be great. But when there is only a slight pref- 
erence for A over B we might feel that there will be 
little difference in their market popularity. (We 
might go a stage further and guess that products A 
and B were so much alike that respondents had a 
hard time distinguishing between them and chose 
one rather than the other largely by chance. ‘To be 
sure about this we would want to separate those 
who preferred A to B from those who preferred B 
to A and to consider whether we were dealing with 
two different kinds of people with different kinds of 
characteristics.) 

This gives us a way of showing preferences for 
several products or different brands of a product 
field on a scale. It shows us not only which is pre- 
ferred but by how much. But it must be empha- 
sized that if this scale does not correlate with actual 
behavior of some kind, e.g., dollar market share, it 
will not be useful. Now this is something to be de- 
termined by experiment, probably in several prod- 
uct fields. All that can be said for the scale in the 
absence of such experiments is that intuitively it 
commends itself as worth trying and as more rea- 
sonable than the rough practice of adding ranks 
together. 
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Equal Appearing Intervals 

The same thoughts which gave rise to the Law of 
Comparative Judgment have inspired two other 
kinds of attitude scaling methods, both very popu- 
lar in the psychological literature: the method of 
equal appearing intervals and the method of suc- 
cessive intervals. ‘These methods are more often 
used to describe the chance of individuals exhibit- 
ing some kind of behavior than they are to describe 
the chance of a product attracting it. Both methods 
involve a two-stage process. 

In both, a large number of statements are col- 
lected, all of them about the object of the behavior 
in which we are interested. A number of respond- 
ents, usually 50 or more, are asked to sort each 
statement into one of seven, nine or eleven piles 
ranging in degree of favorableness to some kind of 
behavior. The sorters are instructed to make the 
difference in favorableness between consecutive 
piles appear equal. In the method of equal appear- 
ing intervals the scale value of each statement is the 
average of the pile numbers in which it is found. 
The attitude scale itself is made up of a selection of 
some ten to 20 statements representing a wide range 
of scale values and each of which was given a fairly 
small dispersion between different piles. The scale 
is then ready to take into the field and respondents 
are given scale scores equal to the average scale 
values of all the items with which they agree. 

The big question, of course, is whether the first 
set of respondents can really put a series of state- 
ments into piles such that assent to them repre- 
sents equally spaced out probabilities of, say, pur- 
chasing action. If they cannot, there is no reason to 
regard this kind of scale as an interval scale. 

The method of successive intervals is similar to 
that of equal appearing intervals but goes one bet- 
ter. It differs from it only in the way in which the 
scale scores are computed for each statement. In the 
method of successive intervals, the scale distances 
between the items are defined in terms of the way 
in which they were spread between the piles when 
sorted by the first set of respondents. 


Rating Scales 
So far we have considered scales which have been 
constructed with some kind of theory about an un- 
derlying continuum in mind. We now turn to 
scales which are used simply because they are often 
found to work. These are much simpler in con- 
ception and generally easier to administer. 
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One such is the Likert scale. To construct it we 
again collect a large number of statements, all of 
which seem to be related to the action we want to 
predict. A first set of respondents is invited to ex- 
press some shade of agreement or disagreement with 
each statement. Usually they are offered such cate- 
gories as: strongly disagree, disagree, don’t know, 
agree and strongly agree. These response categories 
are then scored, say, —2, —1, 0, 1, 2 and each re- 
spondent’s score is totalled. Each respondent thus 
has a score for each item and for all the items to- 
gether. At this stage any one of several processes is 
used to select those items whose scores closely relate 
to the total of all item scores. These items then con- 
stitute the scale and it is ready to be taken into the 
field. 

By this procedure scales can be devised which 
correlate well with behavior, though not all scales 
produced in this way do. It can happen that the 
total score on all the items taken together is itself 
of little predictive value. In this case the items con- 
stituting the scale will probably be even less good. 
Passing over this unpleasant prospect we can ad- 
mire one of the scale’s happier characteristics; that 
of reliability. By this we refer to the way in which 
the same respondents tend to generate the same 
score on different occasions. A scale’s ability to 
correlate with some measure of behavior, which we 
often describe as its validity, obviously cannot be 
any better than its ability to agree with itself from 
one moment to the next. 

The Likert scale is only one of many which gen- 
erates numbers describing how a respondent feels 
about a product or an issue. Other examples, much 
more frequently used, ask the respondent about the 
product more directly. 

For exampie, we might ask the question, “How 
do you feel about X? Do you approve strongly, ap- 
prove, don’t know, disapprove, or 
strongly?” These answers can then be scored +2, 
+1, 0, —1, —2. Again we might ask, “Give me a 
number from +5 to —5 to show me how you feel 
about X. If you like X, give it a plus number and 
the more you like it the bigger the plus number you 
should give it. If you dislike X, give it a minus num- 
ber; the more you dislike it the bigger the minus 
number you should give it.” 

These and other procedures generate a number 
for each respondent, but generally the intention is 
not to predict the behavior of an individual but co 
predict the behavior of groups of them. To do this 
we must somehow combine results derived from 
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disapprove 


many respondents and it is chiefly this need that 
prompts us to assign numbers to respondents’ 1e- 
sults. But adding these numbers is misleading, «s 
we have already illustrated in discussing the method 
of paired comparisons. 

Sometimes researchers have preferred to work 
with verbal categories but this only exchanges one 
knotty problem for another. Consider, for example, 
a test of a commercial with “before” and “after” 
attitude measures of the product. It mav provide 
this sort of result: 


Before 
Like strongly 25% 
Like 40 
Don’t know 20 
Dislike 5 
Dislike strongly 10 


100%, 


Did the commercial do the product any good? 
Even if the two attitude measures were performed 
on the same respondents, and if there were no con- 
ditioning effects arising from asking the same peo- 
ple the same questions, we should still find our- 
selves asking whether one respondent improved 
from “like” to “like strongly” was worth two de- 
teriorating from “don’t know” to “dislike strongly.” 

One way of bringing order out of this chaos is to 
relate the scale directly to the criterion variable of 
purchase. The following example illustrates this 
method. We will suppose that we are testing a com- 
mercial for some specific product and are using an 
eleven-point attitude scale to make the evaluation. 
For convenience we shall suppose that we have 1,000 
respondents. 


Predicted 
No.of % of users no. of 
Before users percategory After users 
155 
54 


Category 


Now 


+ 


TOTAL 1,000 


In this method each of the categories from -+-5 to 
—5 is regarded as qualitatively different: they 
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could equally well be identified by names for at no 
stage do we add them together. In each category of 
the scale we find the number of purchasers of the 
product. We then assume that after the commercial 
the same proportion of people in each category will 
purchase the product. Thus in the table above, 
looking at the +5 category we find that 155, or 
44.1 per cent of respondents out of the 350 in this 
category use the product. After the commercial there 
were 458 respondents in this category and if we 
assume that 44.1 per cent of them will use the prod- 
uct this gives us a predicted 202 users. In the tab- 
ulated illustration the commercial resulted in a 
predicted increase of brand share from 24.3 per cent 
to 27.7 per cent at the times of measurement. ‘The 
method is obviously the same whether categories 
are verbal or numerical. 

This kind of treatment might work well when 
the number of users of the product is quite large. 
Very often, however, the product has a small brand 
share which makes it impossible to compute a de- 
pendable figure for the percentage of users in each 
category. In such cases the percentage of users per 
category might be derived from other brands in the 
same product group. Another difficulty arises when, 
in an example like ours, many respondents move 
into the +5 category not only for the brand in 
which we are interested but also for its competitors. 
Application of the method as described could lead 
to the total of predicted brand shares becoming 
greater than 100 per cent. ‘This can be avoided by 
using an attitude scale which limits the number of 
responses that a subject can make in a single cate- 
gory and which forces the respondent to rank order 
his favored choices. Asking the respondent to allo- 
cate a fixed number of points between all brands, 
and taking the number of points allocated to each 
as its scale value, appears a good device for this 
purpose, 


Summary 

Some qualities that have been discussed must ap- 
pear in any attitude scale that is to be useful. First, 
it must have some consistency over time. If the scale 
is applied twice to the same respondent it should 
generate the same, or almost the same, result. ‘This 
kind of stability, or reliability, is a necessary but 
not sufficient quality in any measuring device. 

Then the scale should be valid, which is to say 
that it must relate to some observable behavior. 
But this is not just one simple quality; it has many 
aspects. The scale must not, for example, relate to 
two or more behaviors at a time for otherwise we 
can never tell which of the two is being measured. 

Since the scale is to relate to behavior it is usually 
desirable for the scale to finely discriminate be- 
tween groups of people who behave differently or 
have different probabilities of behaving in some 
way. This means there must be several scale posi- 
tions available for respondents though reliability 
of the scale will tend to fall off as the number of 
categories is increased. The rule is that the varia- 
tion in behavior between adjoining positions on a 
scale should be great compared with the variation 
of behavior within each category. 

Most attitude scales are used more than once on 
the same subject. If the procedure is too long, or 
difficult to understand, or repetitive about a brand 
or product, the respondent’s first answers will influ- 
ence his answers on the second application. 

All these points are each aspects of one central 
quality, one both necessary and sufficient for a use- 
ful scale; that of relating to behavior. It is not nec- 
essary that the questions used should appear ra- 
tional, or that there should be an_ underlying 
mathematical model. Such considerations merely 
offer valuable hints in the construction of a measur- 
ing device which is likely to work. ‘The one final 
test of an attitude measure is its ability to predict 
behavior. 


Science is nothing but trained and organized common sense, differing from the 
latter only as a veteran may from a raw recruit, and its methods differ from those 
of common sense only as the guardsman’s cut and thrust differ from the manner 


in which a savage wields his club. 
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—THOMAS HUXLEY 
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RESEARCH IN REVIEW 


Gwyn CoLuins, THORNTON C. Lockwoop and GEorGE M. SHIREY 
ARF Staff 


New Way to Spread 
Money on Media 


A. M. Lee and A. J. Burkart. Some Optimization Prob- 
lems in Advertising Media Planning. Operational 
Research Quarterly, Vol. 11, No. 3, September 1960, 
pp- 113-122. 

This stimulating and useful paper may set a new 
trend in media allocation studies. 

Intended for an audience of operations research- 
ers, it is so phrased as to be unattractive to many 
media and research men. For them, the following 
account sketches some of the principal ideas. 

The authors have a campaign in which their ads 
appear only once in each magazine selected. ‘They 
were ‘able, however, to select various amounts of 
space in any of a number of such publications. 
First they sought to maximize frequency; then they 
tried instead to maximize reach. 

They assumed that every page of each magazine 
was equally likely to be read and furthermore, 
that the probability of anyone noting an ad on any 
page was equal to the square root of the proportion 
of the page occupied by it. (This “square root law,” 
recently referred to in a Japanese publication’ as 
“the well known square root law,” is based on some 
experimental evidence presented in a 1914 Psycho- 
logical Review.) Then it was assumed that readers 
of both magazines A and B are no more likely to see 
the ad in B from having previously seen it in A. 

The authors write down an expression for the 
number of people receiving 0, 1, 2, 3, etc., impres- 
sions from any fixed number of magazines. It is an 
ugly thing which we will take for granted. The 
number of people receiving no impressions is then 
written as I,, the number receiving one and only 
one as I,, and so on. Each of these is obviously a 
function of the number of magazines employed and 
the amount of space used in each. Messrs. Lee and 
Burkart then write down a simple formula for ad- 
vertising effectiveness. 


‘Jiro Yamanaka. A Method of Prediction of Readership 
Scores. Tokyo: Dentsu Advertising Ltd., 1960. 
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E (effectiveness) = w,I, + + Wols ... + w,l, 
where the w’s can be given any values we Care to | 
attach to the receipt of no impressions, one impres- 
sion, and so on up to n impressions. If we want E to | 
represent reach we simply set w, equal to zero and | 
all the other w’s equal to one. Similarly for fre. 
quency we set w, equal to zero, w, equal to one, w. > 
equal to two and, in general, w; equal to i. 

Now maximizing reach, for example, is no prob- 
lem with an unrestricted budget. The authors list 
their constraints as: (a) no more than a full page to 
be used in each magazine; and (b) the total money 
to be allocated is fixed and it is assumed that ads 
of less than full page size will have pro rata cost. 

The allocation of fixed resources so as to maxi- 
mize return, subject to a number of simple con- | 
straints, is now a fairly everyday procedure of op- } 
erations research. Electronic computer programs ¢ 
may be bought ready-made for this purpose. 

Maximizing average impressions, involving as it | 
does the total audience of the magazines, is a rela- 
tively simple problem, for the equations necessary 
have only as many terms as the number of mag: | 
azines considered. The solution provided by Lee | 
and Burkart is: for greatest average impression, 
advertising space should be bought in each publi- } 
cation in inverse proportion to the square of its cost | 
per thousand. ; 

The reach problem is a lot less tractable because | 
it involves unduplicated readership. To represent } 
it for three magazines an equation with seven teris 
is necessary. For 15 magazines as used in this ar 
ticle the number of terms would be 32,767. This | 
difficulty could have been greatly eased if the au- | 
thors had had access to J.-M. Agostini’s work de- | 
scribed on pp. 11-14 of this Journal. Instead they > 
make some radical assumptions. These amount to 
accepting that all the magazines have roughly equal [- 
readership, that the readership of each magazine | 
overlaps the readership of each other in purely | 
chance fashion, and that duplications of higher 
than the third order are fairly insignificant. 
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The authors carried out each calculation by 
starting with the allocation for greatest frequency 
and then moved by successive approximations to 
the best allocation for reach. Surprisingly these 
solutions were very close to one another. This 
prompted the authors to explore a rule to deter- 
mine the conditions under which the solution of 
both problems became identical. Their conclusion 
is that if the magazines’ readerships overlap in a 


chance way, and if the ratio of the square of the — 


readership of each to its cost per thousand is even 
roughly equal for each of the magazines, then the 
optimum reach and frequency are obtained by 
buying space in each publication inversely propor- 
tional to the square of its cost per thousand, pro- 
vided each ad is no more than full page. 

The authors are now applying their ideas to a 
series of ads in one or more publications, decay and 
acquisition effects, television, combinations of print 
and TV, and the spacing of ads and commercials 
over time. —Gwyn COLLINs 


Now You See It, 
Now You Don’t 


Edwin B. Parker. Subliminal Stimulation and Voting 
Behavior. Journalism Quarterly, Vol. 37, No. 4, 
Autumn 1960, pp. 588-590. 

The literature on subliminal stimulation (sub- 
liminal perception, subception, learning without 
awareness, etc.) has devoted a great deal of atten- 
tion to attitudes and behavior generated from so- 
called subliminal stimulation, but considerably less 
to the stimulus itself. 

Any stimulus may be defined in terms of its physi- 
cal properties such as duration, intensity and fre- 
quency. However, as ARF’s The Application of 
Subliminal Perception in Advertising points out, 
the limen (threshold) is a statistical concept usually 
defined as that stimulus level at which perception 
is claimed by 50 per cent of the group exposed. 
Therefore a subliminal stimulus must be less than 
that level, but presumably perceptible by at least 
some of the group. 

Furthermore, perception of a subliminal stimulus 
varies among different individuals, and with the 
same individual at different times. Perception of 
communication also depends upon the individual's 
psychological predisposition. For example, visual 
messages transmitted subliminally by television 
would obviously have little effect, regardless of 
their duration, intensity or frequency, on those re- 
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spondents only listening to television, i.e., reading 
a book or out of the room. 

Recent reports of subliminal perception have not 
always described satisfactorily the stimulus in the 
response terms mentioned above. Reporting a stim- 
ulus as subliminal does not make it so. A stimulus 
may be so subliminal as to be imperceptible to all. 
Writers on the subject agree that stimulation can 
be considered subliminal for any given respondent, 
if the individual is not “consciously aware” of the 
stimulus, but do not agree as to what constitutes 
“conscious awareness.” 

In this article the author’s experimental design 
provided for four groups to be exposed “sublim- 
inally.” Two were motivated by being told that a 
film, together with a trait list for each of two candi- 
dates, provided the minimal infcrmation needed to 
distinguish between an excellent and a potentially 
dangerous school board member. All four were told 
that the film and trait lists were being studied to 
find out what propaganda effects could be attrib- 
uted to them. Half the motivated and control 
groups were exposed to a “Vote G” message, and 
half to a “Vote H” message. The design apparently 
precluded any control of the level of subliminality. 

The author disposes of this question easily 
enough: the stimulus, he reports, is a “subliminal 
message.” He gives its physical dimensions: bulb 
and projector wattage, shutter speed and aperture. 
The author states that the “stimulus intensity per- 
mitted the message to be seen occasionally by the 
experimenters. None of the subjects reported seeing 
anything unusual during the experiment.” How 
they reported this is not specified. He summarizes: 

These results, coupled with the negative results of other 
researchers, lead to the conclusion that subliminal stimulation 
is at best an inefficient means of communication and that 
techniques of subliminal stimulation are unsuited for the pur- 
poses of mass persuasion. This conclusion must remain tenta- 
tive, however, since it is always conceivable that someone 
will be able to find some conditions under which it will 
work effectively. But until such conditions are found we have 
a straightforward reason, without raising any ethical ques- 


tions, for not using subliminal persuasive techniques: They 
don’t work. 


Was the stimulus too subliminal to produce an 
effect? Would longer or more intense exposure have 
affected voting behavior? While not disagreeing 
with his results, some readers may think the au- 
thor’s conclusions premature. 

What kind of evidence is needed to verify a sub- 
liminally stimulated effect? Assume an experimen- 
tal group of respondents exposed subliminally 
(operationally defined as above) to advertising, and 
a control group not exposed. If the two groups do 
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not differ significantly with regard to claimed 
awareness of the stimulus, and if they do differ in 
the expected direction with regard to attitude 
change (using appropriate measures of claimed 
awareness and attitude), then evidence for success- 
ful subliminal persuasion exists. Congress and ad- 
vertisers would probably be equally intrigued by 
such evidence. As far as we know it does not exist. 

—THoRNTON C. Lockwoop 


Attitudes Are 


Answers to Questions 


Hugh Hardy. Study Shows Public Attitude Toward Ad- 
vertising is Surprisingly Good. Advertising Age, 
Vol. 31, No. 49, December 5, 1960, pp. 77-82. 


Last July the Politz bi-monthly Omnibus at- 
tempted to measure public attitudes toward adver- 
tising. This report of that study can be conveniently 
divided into two parts. 

The first reports answers to: “In general, which 
one of these statements best expresses your Own at- 
titude toward advertising?” The author “condensed 
the seven-point scale of attitudes to three on the 
charts for simplicity,” and found 17 per cent “gen- 
erally unfavorable.” ‘The results of other questions, 
which the author says pertain to various aspects of 
believability, are then taken up. Negative attitudes 
toward advertising expressed in such terms as false, 
deceptive, misleading, untruthful, irritating, dis- 
tasteful and unfavorable are ravelled and unrav- 
elled. This all leads to the lengthier second part of 
the study which is well summarized by the author 
as follows, “. . . if a more definitive study were to 
be made, it should make an effort to separate ques- 
tions of taste, irritability, and believability from 
the question of deception.” 

Mr. Hardy labels most antagonists of advertising 
as “intellectuals.” For convenience, he lumps into 
this category anyone who expresses negative atti- 
tudes toward advertising. Some readers may sympa- 
thize with his motives but disagree with his defini- 
tion. 

The author’s main conclusion is built around the 
17 per cent reported to be “generally unfavorable.” 
But some readers may ponder the author's conten- 
tion that the study “was geared to negative 
responses . . . intentionally [to obtain] findings 
{with] a virtual maximum of criticism.” Why this 
approach was desired is unclear, since the author 
himself says that “nowhere did we see any surveys 
that could be considered a reliable and objective 
expression of public likes and dislikes.” Whether 
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this objective was fully implemented is another 
matter; the only question reported (see above) does 
not appear to be deliberately constructed to pro. 
duce a “maximum” of negative answers. Perhaps 
something else in the questionnaire produced the 
desired effect. 

Those who carefully read this article will realize 
that measuring attitudes toward advertising is an 
extremely complex business. To them Mr. Hardy 
has done a real service. It is unfortunate that a 
more appropriate title and introduction weren't 
used to put across this most important result of the 


survey. 


For attitude measures are rarely useful by them- 
selves. The difficulty of using them increases when 
the relationship between attitudes and behavior is 
ignored. Mr. Hardy’s study has shown some difler- 
ences between the attitudes toward advertising ol 
various subgroups in the sample. But is advertising 
also less-effective with men, post-graduates and _peo- 
ple 65 years of age and older? Are people who claim 
to have seen false or misleading advertising less in- 
fluenced by advertising? Do respondents who recall 
the product class and specific brand involved in 
their claim of false or misleading advertising never 
buy these products and brands? Clearly the adver- 
tiser could use answers to such questions. With 
them he probably would be more confident of han- 
dling the political and social issues advertising 
involves. 

In the absence of behavioral validation, measures 
of public attitudes toward advertising may provide > 
some value by being asked repeatedly over time. 
This would at least reveal trends in answers to the 
questions asked. 

The author is to be commended for tackling the 
attitudes-toward-advertising tiger. At the conclu. 
sion, however, the issue of who has swallowed whom 


is still in doubt. —THORNTON C. Lockwoop 


If at First You 
Don’t Succeed, Wait 


Schwerin Research Corporation. When Should the Ef- 
fects of Television Advertising be Measured? Part 
2: Changes in Attitude and Behavior. Technical and 
Analytical Review, No. 5, Summer 1960, pp. 1-14. 


In advertising research, attitudes are frequently 
distinguished from behavior by the techniques used 
to measure each. An attitude is an indirect and sub- 
jective measure in that it requires a verbal or writ 
ten report by the individual. Behavior may be 
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measured directly and objectively, with limited, if 
any, reporting by the individual. If the reader ap- 
plies these definitions to the title of this pamphlet, 
he will find no evidence of behavior changes result- 
ing from television advertising. 
What we have instead is a description of seven 
studies, each of which provides evidence of attitude 
changes resulting from various kinds of communica- 
‘tions. Four of the projects concern political com- 
' munications quite remote from advertising, and 
‘three concern television commercials. ‘The former 
are drawn from professional journals and_ present 
- evidence of the persistence of attitude change, the 
sleeper (delayed) effect resulting from low commu- 
nicator credibility, the effect of reinstatement of 
the communicator on the sleeper effect, and the re- 
_ inforcement of attitude change resulting from repe- 
tition. 
This is a well written beginning and interestingly 
_ presented. The author is ready to take up television 
advertising. Perhaps his warning that “these data 
5 are not structured either to confirm or deny the 
| hypotheses suggested by the academic data” ade- 

“quately prepares some readers for the subsequent 

let-down. 
_ Interpreting Schwerin Research Corporation 
data, the Review states that a TV commercial 
“wears out” when it motivates a smaller share of 
‘the audience to want the advertised brand, as 
' measured by the competitive preference technique. 
_ Competitive preference increments are reported for 
31 anonymous products after various periods of ad- 
vertising. This brief consideration of wearing out 
suggests that the advertiser is interested only in the 
‘increment, and not the level, of brand preference 
at different points in time. Many advertisers would 
f probably have been interested in finding out 
f whether levels as well as increments of brand pref- 
erence differed significantly. 

The author next discusses the sleeper effect of 
F television advertising, and presents data which sug- 
gest that seasonal influences and intervening cam- 
/paigns may cause this effect on attitude change. 
/ The key word here is “suggest.” Inadequate con- 
‘ol of frequency of exposure to commercials, the 
} Probable noncomparability of samples, and _ the 
Variation in time intervals preclude any general 
conclusions. 


One value of this pamphlet should not be over- 
looked. In advertising research, where’ security 
Precautions often seem unnecessarily excessive, re- 
‘search proposal writers suffer from the lack of ex- 
_pesure to the thoughts and experiences of other 
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practitioners. With the excellent background pro- 
vided by this pamphlet, some readers may be stimu- 
lated to answer the author’s plea for “structured 
research [in order to examine thoroughly] the oc- 
currence of a delayed increase in commercial 
effectiveness.” 

But consolidating findings, however laudable in 
intent, is not always useful. It is difficult to inter- 
pret differing results, obtained by using different 
measuring devices and administering different mes- 
sages to people with different predispositions. ‘The 
author proposes an uneasy marriage between aca- 
demic and commercial! research efforts as a way of 
determining study objectives. ‘The reviewer sin- 
cerely hopes research therapy will save this mar- 
riage. 

—THOoRNTON C. Lockwoop 


Necessary But 


Insufficient 


Joseph T. Klapper. The Effects of Mass Communication. 
Glencoe, Illinois: The Free Press, 1960. $5.00. 


Mr. Klapper has collected published findings and 
provocative conjecture concerning the social and 
psychological effects of mass communication. He 
has reviewed some 1,000 studies, essays and reports 
and selected over 270 to include in his book. 

They are organized around five generalizations: 


1. Mass communication ordinarily does not serve as a nec- 
essary and sufficient cause of audience effects, but rather 
functions among and through a nexus of mediating factors 
and influences. 

2. These mediating factors are such that they typically 
render mass, communication a contributory agent, but not 
the sole cause, in a process of reinforcing the existing con- 
ditions. 

3. On such occasions as mass communication does function 
in the service of change, one of two conditions is likely to 
exist. Either: (a) the mediating factors will be found to be 
inoperative and the effect of media will be found to be di- 
rect; or (b) the mediating factors, which normally favor rein- 
forcement, will be found to be themselves impelling toward 
change. 

4. There are certain residual situations in which mass 
communication seems to produce direct effects, or directly 
and of itself to serve certain psychophysical functions. 

5. The efficacy of mass communication is affected by vari- 
ous aspects of the media and communications themselves or 
of the communication situation. 


The first part of the book is concerned with mass 
communication as an agent of persuasion, investi- 
gating its effect as a reinforcement agent, its effi- 
ciency in creating opinions and attitudes on new 
issues, and its ability as a conversion agent. The 
second half of the book deals with research on the 
effects of certain specific media content, e.g., media 
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treatment of crime and violence, the effects of 
“escapist” material, the effects of adult television 
programs on child audiences, and finally, the al- 
leged dangers of the passive occupation of media 
attendance. After each chapter the author sum- 
marizes the findings reported. 

The problem of organizing and summarizing 
270 studies and of making the finished product 
readable is a difficult one which has been success- 
fully solved by Mr. Klapper. He includes the work 
carried out in Great Britain on the effects of tele- 
vision, and he elaborates on several areas of com- 
munication research he only briefly touched on in 
an earlier work of the same title. 

In reading the book, it becomes obvious that the 
author is well acquainted with the studies. From 
time to time, but all too rarely, he draws on his 
experience and injects his own thoughts on some 
aspect of the subject. 

For the most part, Mr. Klapper draws heavily on 
the views of others. This reviewer regrets that we 
did not have more of Mr. Klapper, for his insights 
and interpretations are among the most stimulating 
passages of the book. This reviewer hopes he will 
find it possible to contribute a further work which 
will emphasize his own theories. As it is, this book 
will be an asset for all who are interested in mass 


communication. —GEORGE M. SHIREY 


Something Old, 
Something New 


Walter Gruen. Preference for New Products and Its Re- 
lationship to Different Measures of Conformity. 
Journal of Applied Psychology, Vol. 44, No. 6, De- 
cember 1960, pp. 361-364. 

The middle class American seeks the approval of 
his peers by keeping up-to-date. So say Whyte, 
Fromm and Riesman, and the author of this study 
fears that as a result the advertiser may stress “too 
frequently the newness of his product as a sales ap- 
peal.” He is not sure that the encouragement of 
such “subjective obsolescence” is entirely justified. 

The author translates his concern into a specific 
hypothesis: “some people prefer new styles and new 
exteriors regardless of the type of product.” His ex- 
perimental design further translates “some people” 
as 114 college psychology students, “newness” as two 
ranges of years, 1950-56 vs. 1957-59, and “product” 
as foreign cars, silverware patterns, dinnerware, 
spring suit ensembles, dresses, living room furni- 
ture, office furniture, project house models, and 
labels from canned soup and instant coffee. 


38 


Three fairly evenly matched pictures were se. 
lected to represent each product and made nto 
slides. Two were of the same brand with one carry 
ing an old label (a year between 1950-56) and one a 
new label (a year between 1957-59). ‘These intei vals 
were varied purposefully from three to seven year 
to help disguise the study’s purpose (presumably 
from the respondent). For each pair of labelled 
slides, the years were reversed for half the respond. 
ents. The third slide was of a different brand with 
no year label. Appropriate refreshments were served 
during the presentation of soup and coffee slides to 
provide grounds for judging these two items. Re. 
spondents were carefully instructed to “judge cach 
object on the slide, or each food sample according 
to how you feel about it now .. . not what you 
may know or think about the object.” 

Rather than test the hypothesis directly by ask | 
ing which picture was preferred, the author used 
the semantic differential scales (i.e., good-bad, 
beautiful-ugly, clean-dirty and pleasant-unpleasant) 
Osgood found to be most closely associated with his 
evaluative factor. Although the author states that 
the semantic differential was used to obtain a more 
sensitive measure of the reaction than is possible 
by asking for preference, 58 per cent of the sample 
showed no particular preference for either the 
“old” or the “new” products. 

Evaluation scores of each respondent’s predispo- § 
sition toward newness were determined for each ol} 
the ten product categories. In determining these | 
scores and analyzing his results, the author weighted F 
equally the individual polar adjective scales and 
the ten individual products, and applied the same 
ten-year scale of newness to all products. Many 
readers will believe that some justification for this 
weighting should have been provided. 

Four additional tests of conformity were adminis 
tered to the subject: Gross’s other- vs. inner-directed 
scale, Moeller and Applezweig’s social approval 0! 
self-approval scale, agreement-disagreement on a 
American core culture inventory, and Asch’s scale é 
of agreement or disagreement with an electrical 
stooge’s decision. ‘The author found no significant | 
relationship among his measured preference for} 


newness and these four tests. 


In summary, the author seems to have taken 
Whyte, Fromm and Riesman too literally, too set | 
ously, or both, This reviewer, for one, is hard put 
to detect subjective obsolescence in, say, silverwalt 


patterns from 1950 to 1959. 
—THORNTON C, Lockwoo? 
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' With this growth has come a 


PUBLICATIONS RECEIVED 


 MILprReD CANTRELL 
ARF Librarian 


The ARF Library is proving to be a valuable in- 
formation center. It contains an outstanding and 
in many ways exclusive col- 
lection of reports and clip- 
pings on research concerned | 
with advertising, marketing, 
media, copy, motivation and 
sales effectiveness. Since its 
establishment in 1953, the 
collection has grown rapidly. 


tremendous increase us- 
age by Foundation members. 
Non-members, who are becoming more aware of 
our highly specialized and extensive collection of 
materials on advertising research and related topics, 
are turning to us for this often difficult-to-find in- 
formation. ‘These requests are filled whenever pos- 
sible, but preference is given to members in all cases. 

If you wish to use the library, please call to find 
out if the material you need is available. If it is, an 
appointment to visit can be made. 

Books, research reports or ARF publications are 
also available through inter-library loans. ‘These 
loans should be requested in writing. . 

Acquisitions since the last issue of this Journal 
(December 1960) are listed below. Permanent copies 
can be obtained directly from the publisher; loan 
copies are available to our members. 


Anns, John, Who’s Who in Market Research. Lausanne, Switz- 
erland: John Anns Publishing Company, 1960. 244 pp. 
$10.00. 


Brady, Dorothy and Eleanor M. Snyder, Relation of Family 
Status to Income in the Labor Force, New York State, 1956. 
New York: New York State Department of Labor, January 
1960. 19 pp. Free. 


Britt, Steuart Henderson, The Spenders. New York: McGraw- 
Hill Book Company, Inc., 1960. 293 pp. $4.95. 


Bureau of Advertising, Newspapers at Your Service. New 
York: Bureau of Advertising, 1960-61. 35 pp. Free. 


Dahl, Robert, Mason Haire and Paul Lazarsfeld, Social Sci- 
ence Research on Business, New York: Columbia University 
Press. 1959. 185 pp. $3.00. 


Deutschmann, Paul J., Newspaper Content of Twelve Metro- 
politen Dailies. Cincinnati: Scripps-Howard Research, Sep- 
1959. 156 pp. $4.00. 
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Fetuiler, Leon, A Theory of Cognitive Dissonance. Evanston, 
Illinois: Row, Peterson and Company, 1957. 291 pp. $5.25. 


Hancock, Robert $. (Editor), Dynamic Marketing for a Chang- 
ing World. Chicago: American Marketing Association, 1960. 
570 pp. $5.00 to AMA members, $7.50 to non-members. 


Judkins, Jay, Directory of National Associations of Business- 
men, 1960. Washington, D. C.: U. S. Government Printing 
Office, November 1960. 81 pp. $.50. 


Klapper, Joseph T., The Effects of Mass Communication. 
Glencoe, Illinois: ‘The Free Press, 1960. 302 pp. $5.00. 


Klein, B. and Company, Guide to American Directories. New 
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Agostini’s Constant 


To laymen the world of science must seem popu- 
lated mainly by things. Microscopes and telescopes, 
cyclotrons and Univacs, bathyspheres and bubble 
chambers—these are the heroes of popular scientific 
writing. The scientist rarely upstages the machines 
he uses or invents. When he does, it may be only as 
the user or inventor, one supernumerary member 
of a robot cast of thousands. 

Many a scientist is probably reconciled to this 
state of affairs. He may comfort himself with the 
thought that among his colleagues, at least, there 
exists a time-honored distinction between tech- 
nology and science, with the latter a rather more 
prestigious activity. From his standpoint, the ma- 
chines are merely means to an end. 

But this poses an interesting problem to the sci- 
entist human enough to want public acclaim. How 
can he become known to his fellow citizens? If he 
invents a gadget, any resulting fame is bought at 
the price of perpetuating the popular notion of sci- 
ence as subordinate to technology. He would prefer 
to be associated with his goal, not his tools. 

This goal is nothing less than the extension of 
human knowledge. The difficulty is that such 
knowledge seldom comes neatly packaged. Instead 
it is accumulated in darts and rushes, a hunch here, 
a fact there. Once in a great while he may be able 
to report a reliable empirical law, a tested and 
proven theory, a parameter stable enough to be 
called a constant. All too often it goes unnamed, 

We wonder if, when Archimedes leapt from his 
bath crying “Eureka!” he knew he had found not 
only the law of specific gravity but the scientist’s 
path to fame as well. Since his day, few others have 
enjoyed the exquisite thrill of seeing their names 
similarly immortalized: Mendel’s Law, Einstein's 
Theory, Planck’s Constant—there are not many. 

If such fame is rare in the physical and biological 
sciences, it is virtually nonexistent in the infant 
science of marketing. In our experience, Reilly's 
Law of Retail Gravitation is the only case, and 
Reilly himself has modestly disclaimed the scope 
and precision implied by the analogy. We would 
prefer to attribute this lack to similar modesty by 
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other discoverers rather than to the absence of find. 
ings of permanent value. 


So it is with personal pleasure and vicarious pride} 
that we suggest a namer of knowledge from acver-} 
tising research. He is Jean-Michel Agostini and he} 
reports his finding on pages 11-14. Briefly, he has} 
discovered that the unduplicated audience of three | 
or more magazines can be accurately estimated from F 


the total audience of those magazines and thei 
duplications in pairs. He represents this relation. 
ship by a simple equation containing only one 
parameter; this parameter equals 1.125 for several 
combinations of both French and U. S. magazines. 
Now it is much too early to regard this finding as 
any more general than M. Agostini carefully report 
it. Even in the physical sciences, constants have « 


way of changing their values as their fields of apf 
plicability are extended. But it is perhaps not too} 


early to draw some morals from this finding. 


Most obvious is that mathematical models save F 


time and money. If one obtains simultaneously the 


total and pair-wise duplicated audiences of a large} 
number of publications, then tabulations of the 4 
thousands of higher-order combinations may not} 
need to be done. A few combinations will suflice to : 
determine the generality of Agostini’s Constant. If P 
it holds, work is eliminated. If it does not, we will f 
still have a sensitive dependent variable on which) 


to observe the possible influence of other factors. 


Second, to name this finding will remind us ol} 


the proper roles in science of knowledge, men and 
things. Knowledge is the goal, and when found it 
should be named as readily as an inventor christen 
his brainchild, or an advertiser his new brand. A 
man had the wit to find it, so his should be it 


name. Machines were merely the means, and shall F 


‘remain nameless. 


Third, and perhaps most vital in the long run, ¢} 


finding called constant will stimulate further ref). 


search. ‘The very word both gratifies and challenges 


the scientist. He knows that an hypothesis has bee f 
stated according to the only criterion of a goo! | 
hypothesis: that it be testable. The lists are ope! | 


and the gauntlet is down. 
—CHARLES K. RAMON) 


JOURNAL OI 


| 
| 
| 
= | 


